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1 Introduction 

Phase transitions are one of the most fascinating, and also most perplexing, phe- 
nomena in equilibrium statistical mechanics. On the physics side, many approx- 
imate methods to explain or otherwise justify phase transitions are known but a 
complete mathematical understanding is available only in a handful of simplest of 
all cases. One set of tractable systems consists of the so called lattice spin models. 
Originally, these came to existence as simplified versions of (somewhat more real- 
istic) models of crystalline materials in solid state physics but their versatile nature 
earned them a life of their own in many other disciplines where complex systems 
are of interest. 

The present set of notes describes one successful mathematical approach to phase 
transitions in lattice spin models which is based on the technique of reflection posi- 
tivity. This technique was developed in the late 1970s in the groundbreaking works 
of F. Dyson, J. Frohlich, R. Israel, E. Lieb, B. Simon and T. Spencer who used it to es- 
tablish phase transitions in a host of physically-interesting classical and quantum 
lattice spin models; most notably, the classical Heisenberg ferromagnet and the 
quantum XY model and Heisenberg antiferromagnet. Other powerful techniques 

— e.g., Pirogov-Sinai theory, lace expansion or multiscale analysis in field theory 

— are available at present that can serve a similar purpose in related contexts, but 
we will leave their review to experts in those areas. 

The most attractive feature of reflection positivity — especially, compared to the 
alternative techniques — is the simplicity of the resulting proofs. There are gener- 
ally two types of arguments one can use: The first one is to derive the so called 
infrared bound, which states in quantitative terms that the fluctuations of the spin 
variables are dominated by those of a lattice Gaussian free field. In systems with an 
internal symmetry, this yields a proof of a symmetry-breaking phase transition by 
way of a spin-condensation argument. Another route goes via the so called chess- 
board estimates, which allow one to implement a Peierls-type argument regardless 
of whether the model exhibits an internal symmetry or not. 
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Avid users of the alternative techniques are often quick to point out that the sim- 
plicity of proofs has its price: As a rather restrictive condition, reflection positivity 
applies only to a small (in a well defined sense) class of systems. Fortunately for 
the technique and mathematical physics in general, the models to which it does 
apply constitute a large portion of what is interesting for physics, and to physi- 
cists. Thus, unless one is exclusively after universal statements — i.e., those robust 
under rather arbitrary perturbations — the route via reflection positivity is often 
fairly satisfactory. 

The spectacular success of reflection positivity from the late 1970s was followed 
by many interesting developments. For instance, in various joint collaborations, 
R. Dobrushin, R. Kotecky and S. Shlosman showed how chessboard estimates can 
be used to prove a phase transition in a class of systems with naturally-defined 
ordered and disordered components; most prominently, the ^-state Potts model 
for q 3> 1. Another neat application came in the papers of M. Aizenman from 
early 1980s in which he combined the infrared bound with his random-current rep- 
resentation to conclude mean-field critical behavior in the nearest-neighbor Ising 
ferromagnet above 4 dimensions. Yet another example is the proof, by L. Chayes, 
R. Kotecky and S. Shlosman, that the Fisher-renormalization scheme in annealed- 
diluted systems may be substituted by the emergence of an intermediate phase. 

These notes discuss also more recent results where their author had a chance to 
contribute to the field. The common ground for some of these is the use of reflection 
positivity to provide mathematical justification of "well-known" conclusions from 
physics folklore. For instance, in papers by N. Crawford, L. Chayes and the present 
author, the infrared bound was shown to imply that, once a model undergoes a 
field or energy driven first-order transition in mean-field theory, a similar transi- 
tion will occur in the lattice model provided the spatial dimension is sufficiently 
high or the interaction is sufficiently spread-out (but still reflection positive). An- 
other result — due to L. Chayes, S. Starr and the present author — asserts that if a 
reflection-positive quantum spin system undergoes a phase transition at interme- 
diate temperatures in its classical limit, a similar transition occurs in the quantum 
system provided the magnitude of the quantum spin is sufficiently large. 

There have also been recent cases where reflection positivity brought a defi- 
nite end to a controversy that physics arguments were not able to resolve. One 
instance concerned certain non-linear vector and liquid-crystal models; it was de- 
bated whether a transition can occur already in 2 dimensions. This was settled 
in recent work of A. van Enter and S. Shlosman. Another instance involved spin 
systems whose (infinite) set of ground states had a much larger set of symmetries 
than the Hamiltonian of the model; two competing physics reasonings argued for, 
and against, the survival of these states at low temperatures. Here, in papers of 
L. Chayes, S. Kivelson, Z. Nussinov and the present author, spin-wave free en- 
ergy calculations were combined with chessboard estimates to construct a rigorous 
proof of phase coexistence of only a finite number of low-temperature states. 
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These recent activities show that the full potential of reflection positivity may 
not yet have been fully exhausted and that the technique will continue to play an 
important role in mathematical statistical mechanics. It is hoped that the present 
text will help newcomers to this field learn the essentials of the subject before the 
need arises to plow through the research papers where the original derivations 
first appeared. 

Organization 

This text began as class notes for nine hours of lectures on reflection positivity 
at the 2006 Prague School and gradually grew into a survey of (part of) this re- 
search area. The presentation opens with a review of basic facts about lattice spin 
models and then proceeds to study two applications of the infrared bound: a spin- 
condensation argument and a link to mean-field theory. These are followed by the 
classical derivation of the infrared bound from reflection positivity. The remainder 
of the notes is spent on applications of a by-product of this derivation, the chess- 
board estimate, to proofs of phase coexistence. 

The emphasis of the notes is on a pedagogical introduction to reflection posi- 
tivity; for this reason we often sacrifice on generality and rather demonstrate the 
main ideas on the simplest case of interest. To compensate for the inevitable loss of 
generality, each chapter is endowed with a section "Literature remarks" where we 
attempt to list the references deemed most relevant to the topic at hand. The notes 
are closed with a short section on topics that are not covered as well as some open 
problems that the author finds worthy of some thought. 

Acknowledgments 

A review naturally draws on the work of many authors. We will do our best to 
give proper credit to their contribution in the sections "Literature remarks" that 
appear at the end of each chapter and, of course, in the list of references. I have 
personally learned a great deal of the subject from Lincoln Chayes with whom 
I have subsequently coauthored more than half-a-dozen papers some of which 
will be discussed here. During this time I also benefited from collaborations with 
N. Crawford, S. Kivelson, R. Kotecky, Z. Nussinov and S. Starr, and from various 
enlightening discussions involving A. van Enter, B. Nachtergaele and S. Shlosman. 

The text would presumably never exist were it not for Roman Kotecky's sum- 
mer school; I wish to thank him for allowing me to speak on this subject. I am in- 
debted to the participants of the school for comments during the lectures and to T. 
Bodineau, A. van Enter, E. Lieb and S. Shlosman for suggestions on the first draft of 
the notes. My presence at the school was made possible thanks to the support from 
the ESF-program "Phase Transitions and Fluctuation Phenomena for Random Dy- 
namics in Spatially Extended Systems" and from the National Science Foundation 
under the grant DMS-0505356. 
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2 Lattice spin models: Crash course 

This section prepares the ground for the rest of the course by introducing the main 
concepts from the theory of Gibbs measures for lattice spin models. The results in- 
troduced here are selected entirely for the purpose of these notes; readers wishing 
a more comprehensive — and in-depth — treatment should consult classic text- 
books on the subject. 

2.1 Basic setup 

Let us start discussing the setup of the models to which we will direct our attention 
throughout this course. The basic ingredients are as follows: 

• Lattice: We will take the d-dimensional hypercubic lattice Z rf as our underlying 
graph. This is the graph with vertices at all points in 1R d with integer coordinates 
and edges between any nearest neighbor pair of vertices; i.e., those at Euclidean 
distance one. We will use (x,y) to denote an (unordered) nearest-neighbor pair. 

• Spins: At each x G Z rf we will consider a spin S x , by which we will mean a 
random variable taking values in a closed subset Q of W, for some v > 1. 
We will use S x ■ S y to denote a scalar product between S x and S y (Euclidean or 
otherwise). 

• Spin configurations: For A C Z d , we will refer to Sa := (S x ) xG a as the spin 
configuration in A. We will be generically interested in describing the statis- 
tical properties of such spin configurations with respect to certain (canonical) 
measures. 

• Boundary conditions: To describe the law of Sa, we will not be able to ignore that 
some spins are also outside A. We will refer to the configuration Sa^ of these 
spins as the boundary condition. The latter will usually be fixed and may often 
even be considered a parameter of the game. When both Sa and Sac are known, 
we will write 

S:=(S A ,S A c) (2.1) 

to denote their concatenation on all of Z d . 

The above setting incorporates rather varied physical contexts. The spins may be 
thought of as describing magnetic moments of atoms in a crystal, displacement 
of atoms from their equilibrium positions or even orientation of grains in nearly- 
crystalline granular materials. 

To define the dynamics of spin systems, we will need to specify the energetics. 
This is conveniently done by prescribing the Hamiltonian which is a function on 
the spin-configuration space Cl zd that tells us how much energy each spin config- 
uration has. Of course, to have all quantities well defined we need to fix a finite 
volume A C Z d and compute only the energy in A. The most general formula we 
will ever need is 
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H A (S):= £ ®a(S) (2.2) 

AcZ d finite 

where <3?a is a function that depends only on Sa- To make everything well defined, 
we require, e.g., that is translation invariant and that YIa^o II^aIIoo < 00 • (The 
infinity norm may be replaced by some other norm; in particular, should the need 
arise to talk about unbounded spins.) It is often more convenient to write the above 
as a formal sum: 

H(S):=£> A (S) (2.3) 

A 

with the above specific understanding whenever a precise definition is desired. 

The energy is not sufficient on its own to define the statistical mechanics of 
such spin systems; we also need to specify the a priori measure on the spins. This 
will be achieved by prescribing a Borel measure ]Iq on Q (which may or may not 
be finite). Before the interaction is "switched on," the spin configurations will 
be "distributed" according to the product measure, i.e., the a priori law of S A 
is (8>xeA ^o(dS T ). The full statistical-mechanical law is then given by a Gibbs mea- 
sure which (in finite volume) takes the general form e^^ H ^ Ylx Fo(dS T ); cf Sect, 
for more details. 



2.3 



2.2 Examples 

Here are a few examples of spin systems: 

(1) O(n)-modeh Here Q := S" _1 = {z e R": \z\z = 1} with jiq := surface measure 
on S" _1 . The Hamiltonian is 

H(S) := -/ £ S x • S y (2.4) 

<*,y> 

where the dot denotes the usual (Euclidean) dot-product in IR" and J > 0. (Note 
that this comes at no loss as the sign of / can be changed by reversing the spins on 
the odd sublattice of X d .) 

Note that if A S 0(n) — i.e., A is an n-dimensional orthogonal matrix — then 

AS X • ASy = S x ■ Sy (2-5) 

and so H(AS) = H(S). Since also ]Iq o A -1 = jiq, the model possesses a global 
rotation invariance — with respect to a simultaneous rotation of all spins. (For n = 1 
this reduces to the invariance under the flip +1 <-> —1.) 

Two instances of this model are known by other names: n = 2 is the rotor model 
while n = 3 is the (classical) Heisenberg ferromagnet. 

(2) Ising model: Formally, this is the 0(l)-model. Explicitly, the spin variables o~ x 
take values inQ := { — !,+!} with uniform a priori measure; the Hamiltonian is 
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H(a) :-- 



-J E 



(2.6) 



Note that the energy is smaller when the spins at nearest neighbors align and 
higher when they antialign. (A similar statement holds, of course, for all O(n) 
models.) This is due to the choice of the sign J > which makes these models 

ferromagnets. 

(3) Potts model : This is a generalization of the Ising model beyond two spin states. 
Explicitly, we fix q S IN and let o~ x take values in {1, . . . , q) (with a uniform a priori 
measure). The Hamiltonian is 



H{a) :- 



(2.7) 



so the energy is — / when o~ x and a y "align" and zero otherwise. The q = 2 case is 
the Ising model and q = 1 may be related to bond percolation on Z d (via the so 
called Fortuin-Kasteleyn representation leading to the so called random-cluster model). 

It turns out that the Hamiltonian ( |2.7| | can be brought to the form ( |2.4| |. Indeed, 
let Q denote the set of q points uniformly spread on the unit sphere in R 1 ? -1 ; we 
may think of these as the vertices of a ^-simplex (or a regular ^-hedron). The cases 
q = 2, 3, 4 are depicted in this figure: 



\1/ 





More explicitly, the elements of Q are vectors v a , a. = 1, . . . , q, such that 



v« -Vp 



I, 



l 



otherwise. 



(2.8) 



The existence of such vectors can be proved by induction on q. Clearly, if S x corre- 
sponds to a x and S y to o~ y , then 



bx by ~ q-\ d ^ q-1 
and so the Potts Hamiltonian is to within an additive constant of 

H(S) := -/ E S, • S v 



(2.9) 



(2.10) 
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with J := J^-jj-- This form, sometimes referred to as tetrahedral representation, will 
be far more useful for our purposes than ( |2.7| |. 

(4) Liquid-crystal model: There are many models that describe certain granular ma- 
terials known to many of us from digital displays: liquid crystals. A distinguished 
feature of such materials is the presence of orientational long-range order where a 
majority of the grains align with one another despite the fact that the system as 
a whole is rotationally invariant. One of the simplest models capturing this phe- 
nomenon is as follows: Consider spins S x £ S' I_1 with a uniform a priori measure. 
The Hamiltonian is 

H(S):=-J^(S x -S y ) 2 (2.11) 

The interaction features global rotation invariance and the square takes care of the 
fact that reflection of any spin does not change the energy (i.e., only the orientation 
rather than the direction of the spin matters). 

As for the Potts model, the Hamiltonian can again be brought to the form rem- 
iniscent of the 0(n)-model. Indeed, given a spin S G S" _1 with Cartesian compo- 
nents a = 1, . . . , n, define annxn matrix Q by 

Q a p := S W» - % (2.12) 

(The subtraction of the identity is rather arbitrary and more or less unnecessary; its 
goal is to achieve zero trace and thus reduce the number of independent variables 
characterizing Q to n — 1; i.e., as many degrees of freedom as S has.) As is easy to 
check, if Q <-> S and Q <-> S are related via the above formula, then 

Tr(QQ) = (S • S) 2 - i (2.13) 
Since Q is symmetric, the trace evaluates to 

Tr(QQ) =Y^QxpQ K fi (2.14) 

which is the canonical scalar product on n x n matrices. In this language the Hamil- 
tonian takes again the form we saw in the 0(n) model. 

At this point we pause to remark that all of the above Hamiltonians are of the 
following rather general form: 

H ( S ) = +^E/^l S *" S / ( 2 - 15 ) 

x,y 

where {] X y) is a collection of suitable coupling constants and | • | denotes the Eu- 
clidean norm in R". This is possible because, in all cases, the (corresponding) norm 
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of S x is constant and so adding it to the Hamiltonian has no effect on the probabil- 
ity measure. The model thus obtained bears striking similarity to our last example: 

(5) Lattice Gaussian free field: Let Q := R, Uq := Lebesgue measure and let P(x,y) be 

the transition kernel of a symmetric random walk on i.e., P(x,y) = P(0,y — 
x) = P(0,x — y). In this case we will denote the variables by <p x ; the Hamiltonian 
is 

H^)-=\L p ( x 'yM-^) 2 ( 2 - 16 ) 

x,y 

This can be rewritten as 

H(<p) = (</>,(! - P)cp) L2(zd) =: 5i_p(tf>,0) (2.17) 



where experts on harmonic analysis of Markov chains will recognize £\-p{<p,<p) 
to be the Dirichlet form associated with the generator 1 — P of the above random 
walk. In the Gibbs measure, the law of the (p x 's will be Gaussian with grad-squared 
interactions; hence the name of the model. 



The sole difference between (2.15 1 and ( 2.16| l is that, unlike the (p x 's, the spins S 



are generally confined to a subset of a Euclidean space and/ or their a priori mea- 
sure is not Lebesgue — which will ultimately mean their law is not Gaussian. One 
purpose of this course is to show how this formal similarity can nevertheless be 
exploited to provide information on the models ( 2.15[ ). 



2.3 Gibbs formalism 

Now we are ready to describe the statistical-mechanical properties of the above 
models for which we resort to the formalism of Gibbs-Boltzmann distributions. 
First we define these in finite volume: Given a finite set A C Z d and a bound- 
ary condition Sac we define the Gibbs measure in A to be the probability measure 
on Q A given by 

}iff(dS A ) := n MdS x ) (2.18) 

Here f> > is the inverse temperature — in physics terms, f> := ^ where ke 
is the Boltzmann constant and T is the temperature measured in Kelvins — 
and Za^(SaO is the normalization constant called the partition function. 
To extend this concept to infinite volume we have two options: 

(1) Consider all possible weak cluster points of the family {^ c ' } as A f Z rf (with 
the boundary condition possibly varying with A) and all convex combinations 
thereof. 

(2) Identify a distinguishing property of Gibbs measures and use it to define infi- 
nite volume objects directly. 
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While approach (1) is ultimately very useful in practical problems, option (2) is 
more elegant at this level of generality. The requisite "distinguishing property" is 
as follows: 

Lemma 2.1 (DLR condition). Let A C A C Z rf be finite sets and let S A c G Q AC . Then 
(for ^l^-a.e. S A c), 

V { lfi\S A c)=^f(-) (2.19) 

In simple terms, conditioning the Gibbs measure in A on the configuration in A\A,ive 
get the Gibbs measure in A with the corresponding boundary condition. 

This leads to: 

Definition 2.2 (DLR Gibbs measures). A probability measure on Cl z,d is called an in- 
finite volume Gibbs measure for interaction H and inverse temperature f> if for all fi- 
nite AcZ 1 * and pi-a.e. Sac, 

M-M = /£f(-) (2.20) 
where is defined using the Hamiltonian H A . 

We will use 0^ to denote the set of all infinite volume Gibbs measures at inverse 
temperature f> (assuming the model is clear from the context). 

Here are some straightforward, nonetheless important consequences of these 
definitions: 



(1) By Lemma 2.1 any weak cluster point of (^ A g ) belongs to 0^ 



(2) By the Backward Martingale Convergence Theorem, if A n f Z and u G 0^, 

(S c ) 

then for ^-a.e. spin configuration S the sequence u K \ has a weak limit, which 
then belongs to 0«. 

(3) 0^ is a convex set (and is closed in the topology of weak convergence). More- 

(S C ) w 

over, u G 0^ is extremal in (25^ iff ji A A ^ — > y. for ^/-almost every spin configu- 
ration S. 

Similarly direct is the proof of the following "continuity" property: 

(4) Let H n be a sequence of Hamiltonians converging — in the sup-norm on the 
potentials — to Hamiltonian H, and let f> n be a sequence with f5 n — > f> < oo. 
Let u n be the sequence of the corresponding Gibbs measures. Then every 
(weak) cluster point of (f/„) is an infinite-volume Gibbs measure for the Hamil- 
tonian H and inverse temperature j5. 

Note that the fact that 0^ is closed and convex ensures that each element can be 
written as a unique convex combination of extreme points (by the Krein-Millman 
theorem). The DLR condition permits to extract the corresponding decomposition 
probabilistically by conditioning on the c-algebra of tail events. 
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Now we give a meaning to the terms that are frequently (though sometimes 
vaguely) employed by physicists: 

Definition 2.3 (Phase coexistence). We say that the model is at phase coexistence (or 
undergoes a first-order phase transition) whenever the parameters are such that \ <3a | > 
1. 

The simplest example where this happens is the Ising model. Let 

A L :={l,...,L} d (2.21) 

and consider the Ising model in A L with all boundary spins set to +1. (This is the 
so called plus boundary condition.) As a consequence of stochastic domination — 
which we will not discuss here — u\ l ^ tends weakly to a measure u + as L — > oo. 

Similarly for the minus boundary condition, ^X L £ ~^ I 1 - ^ turns out that, in 
dimensions d > 2 there exists f> c {d) G (0,oo) such that 

j6 > fi c {d) => u + £ u- (2.22) 

i.e, the model is at phase coexistence, while for /S < p> c (d), the set of all infinite 
volume Gibbs measures is a singleton — which means that the model is in the 
uniqueness regime. One of our goals is to prove similar statements in all of the 
models introduced above. 

2.4 Torus measures 

In the above, we always put a boundary condition in the complement of the finite 
set A. However, it is sometimes convenient to consider other boundary conditions. 
One possibility is to ignore the existence of A c altogether — this leads to the so 
called free boundary condition. Another possibility is to wrap A into a graph with- 
out a boundary — typically a torus. This is the case of periodic or torus boundary 
conditions. 

Consider the torus TY, which we define as a graph with vertices (Z/ LZ) d , en- 
dowed with the corresponding (periodic) nearest-neighbor relation. For nearest- 
neighbor interactions, the corresponding Hamiltonian is defined easily, but some 
care is needed for interactions that can be of arbitrary range. If S G Q Tl we define 
the torus Hamiltonian Hi(S) by 

H L (S) := H Al (periodic extension of S to Z d ) (2.23) 

where we recall A L := {1, . . . , L} d . For H(S) := -\ £ zy } X/ yS x ■ S y we get 

H L (S) = -^ £ J^Sx-Sy (2.24) 

x,yeT L 
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where J™ are the periodized coupling constants 



The Gibbs measure on Q Tl is then defined accordingly: 

e -pH L (S) 

m,p(dS) : = — f[ M*Sx) (2-26) 

Z M X6T L 

where « is the torus partition function. The following holds: 
Lemma 2.4. Epery (weak) cluster point of(}ii ,b)l>i lies in ©0- 



Note that there is something to prove here because, due to ( 2.251 1, the interaction 
depends on L. 

2.5 Some thermodynamics 

Statistical mechanics combines, in its historical development, molecular theory 
with empirical thermodynamics. Many mathematically rigorous accounts of sta- 
tistical mechanics thus naturally start the exposition with the notion of the free 
energy. We will need this notion only tangentially — it suffices to think of the free 
energy as a cumulant generating function — in the proofs of phase coexistence. 
The relevant statement is as follows: 

Theorem 2.5. For x E Z d let x x be the shift-by-x defined by (t x S) v := S XJ - x .Letg: Cl zcl — 
Rbea bounded, local function — i.e., one that depends only on a finite number of spins — 
and recall that pi^ denote the torus Gibbs measures. Then: 

(1) The limit 

/ W : = r lim h lo § l ex P ( h Ls°T x )\ (2.27) 



exists for allh 6R and is convex in h. 
(2) If p. S (5^ is translation invariant, then 



3/ ^ w <- & 



dh 



h=0 



(3) There exist translation-invariant, ergodic measures p ± G (25 ^ such that 

9/ 



h=0 



(2.28) 



(2.29) 
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Proof of (1), main idea. For compact state-spaces and absolutely-summable interac- 
tions, the existence of the limit follows by standard subadditivity arguments. In 
fact, the limit will exist and will be the same even if we replace in ( |2.27| | by 
any sequence of Gibbs measures in Ai with (even L-dependent) boundary condi- 
tions. The convexity of / is a consequence of the Holder inequality applied to the 
expectation in ( 2.27| . □ 



Proof of (2). Let y. S (5^ be translation invariant and abbreviate 

Z L (h):=EJexp(h £ g°T x )\ (2.30) 
Since log Zi is convex in h (again, by Holder) we have for any h > that 



lo g Z L (fc)-logZ L (0)> J^logZ L (&) 



h 

h=0 



hE F ( E g°r x )=h\A L \E }l (g) 



(2.31) 



XGA L 



Dividing by \Ai\, passing to L — > oo and using that / is independent of the bound- 
ary condition, we get 

f(h)-f(0)>hE F (g) (2.32) 

Divide by h and let h [ to get one half of ( 2.28[ ). The other half is proved analo- 
gously. □ 

Proof of (3). Let &a,h be the set of Gibbs measures for the Hamiltonian H — 
(h/p) J^ x g ° t x . A variant of the proof of (2) shows that if }i h 6 <&p /h is translation- 
invariant, then 

w £ E ^> £ & (2 - 33) 

In particular, if h > we have 



(2.34) 

h=0 



by the monotonicity of derivatives of convex functions. Taking h J, and extracting 
a weak limit from ji^, we get a Gibbs measure }i + G such that 



(2.35) 

fc=0 



(The expectations converge because g is a local — and thus continuous, in the prod- 
uct topology — function.) Applying (2) we verify ( |2.29| l for ]i + . 

The measure }i + is translation invariant and so it remains to show that ji + can 
actually be chosen ergodic. To that end let us first prove that 
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u"7 E 8 OTx ; — > E n+(g)' in ^-probability (2.36) 

The random variables on the left are bounded by the norm of g and have expecta- 
tion E„+ (g) so it suffices to prove that the limsup is no larger than the expectation. 
However, if that were not the case, we would have 



H + { E g° T *> (V(£) + £ )I A lI >e (2.37) 

XGA L 

for some e > and some sequence of L's. But then for all h > 0, 

E pt+ \exp(h £ go Tx ) ) > ee lALlh[E >' +{8)+£] (2.38) 

^ V x£A L 7 J 

In light of the independence of the limit in (1) on the measure we use — as dis- 
cussed in the sketch of the proof of (1) — this would imply 

f(h)>h{E F+ (g) + e) (2.39) 

which cannot hold for all h > if the right-derivative of / at h = is to equal 
E fl+ {g). Hence jggg) holds. 



By the Pointwise Ergodic Theorem, the convergence in (236) actually occurs — 



and, by ( |2.36| , the limit equals E„+ (g) — for ^ + -almost every spin configuration. 
This implies that the same must be true for any measure in the decomposition of }i + 
into ergodic components. By classic theorems from Gibbs-measure theory, every 
measure in this decomposition is also in (5^ and so we can choose fi + ergodic. □ 

The above theorem is very useful for the proofs of phase coexistence. Indeed, 
one can often prove some estimates that via ( |2.28| l imply that / is not differentiable 
at h = 0. Then one applies ( 2.29| l to infer the existence of two distinct, ergodic 



Gibbs measures saturating the bounds in ( 2.28| |. Examples of this approach will be 
discussed throughout these notes. 

2.6 Literature remarks 

This section contains only the minimum necessary to understand the rest of the 
course. For a comprehensive treatment of Gibbs-measure theory, we refer to classic 
monographs by Ruelle II88II , Israel 11661 , Simon 11971 and Georgii [57|. Further gen- 
eral background on statistical mechanics of such systems can be found in Ruelle's 
"blue" book [89J. The acronym DLR derives from the initials of Dobrushin and the 
team of Lanford & Ruelle who first introduced the idea of conditional definition of 
infinite volume Gibbs measures; cf e.g. |32 | . 



The proof of Theorem 2.5 touches upon the subject of large deviation theory 
which provides a mathematical framework for many empirical principles under- 
lying classical thermodynamics. The connection of course appears in various dis- 
guises in the textbooks I|66ll97ll57| ; for expositions dealing more exclusively with 
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large-deviation theory we refer to the books by den Hollander [64 J, Dembo and 
Zeitouni \29\, and Deuschel and Stroock [30 1. For the Pointwise Ergodic Theorem 
and other facts from ergodic theory we refer to the textbooks by e.g., Krengel fl73| 
and Petersen [86|. 

Stochastic domination and the FKG inequality — dealing with partial ordering 
of spin configurations, functions thereof and thus also measures — are discussed 
in, e.g., Georgii 1157) or Grimmett [61]. The proof of ( 2.22| | can alternatively be based 
on Griffiths' correlation inequalities (Griffiths |60| ). The phase coexistence in the 
Ising model at large f> was first proved by Peierls via a contour argument that now 
bears his name (see Griffiths EH). 

Concerning the historical origin of the various model systems; the 0(n) model 
goes back to Heisenberg (who introduced its quantum version), the Ising model 
was introduced by Lenz and given to Ising as a thesis problem while the Potts 
model was introduced by Domb and given to Potts as a thesis problem. Ironically, 
the 0(l)-model bears Ising's name even though his conclusions about it were quite 
wrong! Apparently, Potts was more deserving. 

An excellent reference for mathematical physics of liquid crystals is the mono- 
graph by de Gennes and Prost l|56| ; other, more combinatorial models have been 
considered by Heilmann and Lieb l|62t and Abraham and Heilmann [lj. The tetra- 
hedral representation of the Potts model can be found in Wu's review article 111061 ; 
the matrix representation of the liquid-crystal model is an observation of Ange- 
lescu and Zagrebnov [6|. Gradient fields — of which the GFF is the simplest exam- 
ple — have enjoyed considerable attention in recent years; cf the review articles by 
Funaki 11521 , Velenik M104I and Sheffield 1192 1 . Another name for the GFF is harmonic 
crystal. 



3 Infrared bound & Spin-wave condensation 

The goal of this section is to elucidate the significance of the infrared bound — 
postponing its proof and connection with reflection positivity until Section|5] — and 
the use thereof in the proofs of symmetry breaking via the mechanism of spin-wave 
condensation. The presence, and absence, of symmetry breaking in the 0(n)-model 
with certain non-negative two-body interactions will be linked to recurrence vs 
transience of a naturally induced random walk. 

3.1 Random walk connections 

Consider the model with the Hamiltonian 

H = -\Y J 1xyS x -S y (3.1) 

x,y 
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where the spins S x are a priori independent and distributed according to a mea- 
sure pig which is supported in a compact set QcR v . Assume that the interaction 
constants satisfy the following requirements: 

(11) Jxx = and J X/V = }o,y- x 

(12) Lx\hx\ <~andE*/o,* = l 

i.e., the coupling constants are translation invariant, absolutely summable and, for 
convenience, normalized to have unit strength. We will actually always restrict our 
attention to the following specific examples: 

Nearest-neighbor interactions: 



;..,.= c " '' (3.2) 

otherwise. 



Yukawa potentials: 



with pi > and C > 0. 
Power-law decaying potentials: 



J Xiy = Ce-*\ x ~y\i (3.3) 



l>* = .. C ..- ( 3 - 4 ) 



y 



i 



with s > d and C > 0. 
On top of these, we will also permit: 

• Any convex combination of the three interactions above (with, of course, posi- 
tive coefficients). 

Note that we are using the ^-distance (rather than the more natural ^-distance). 



This is dictated by our methods of proof (see Lemma 5.5 1. Also note that the 
Yukawa potential is in the class of Kac models where the coupling constants take 
the form J™ = j d f(^y(x — y)) for some rapidly decaying function/: M rf — > [0, oo) 
with unit L 1 -norm. 

A unifying feature of all three interactions is that J™ > which allows us to in- 
terpret the coupling constants as the transition probabilities of a random walk on 
Explicitly, consider a Markov chain (X„) on Z d with 

P 2 (X„ +1 =y\X n = x) :=J xy (3.5) 

where P z is the law of the chain started at site z. Of particular interest will be the 
question whether this random walk is recurrent or transient — i.e., whether a walk 
started at the origin returns there infinitely, or only finitely many times. Here is a 
criterion to this matter: 
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Lemma 3.1. Let j(k) := Jo iX e lkx , k G [—re, n} d . Then (X n ) is transient if and only if 

dk 1 



n,nf (2n) d 1 - }(k) 



< oo (3.6) 



Proof. Recall that a random walk is transient if and only if the first return time to 
the origin, To := inf{n > 0: X„ = 0}, is infinite with a positive probability, i.e., 
Po(to < oo) < 1. By the formula E N = [1 — Po(to < oo)] -1 — where E is the 
expectation with respect to Po — we thus get that transience is equivalent EoN < 
oo. To compute the expectation, we note 

which via E e* x " = [E e ik - X ^] n = [H x h, x e ihx \ n = /(*)" implies 

P (X„ = 0) = f 7^- d m n (3.8) 
Summing over n > yields 

whereby the claim follows. (A careful proof of the latter identity requires justifica- 
tion of the exchange of the integral with the infinite sum; one has to represent the 
LHS as a power series, perform the sum and justify limits via appropriate conver- 
gence theorems.) □ 

As to the above examples, we have: 

• n.n. & Yukawa potentials: As k — > 0, 

l-j{k) ~C\k\ 2 (3.10) 

and so (X„) is transient iff d > 3. 

• Power-law potentials: Here as k — > 0, 

'\k\ s - d , ifs<d + 2, 

l-J(fc) ~ci |£| 2 log^, ifs = d + 2, (3.11) 
Jfc| 2 , ifs>d + 2. 

Hence (X n ) is transient iff d > 3 OR s < min{d + 2, 2d}. 

(Note that the walk with s < d + 2 has a stable-law tail with index of stabil- 
ity a = s — d.) A convex combination of the three coupling constants will lead 
to a transient walk provided at least one of the interactions involved therein (with 
non-zero coefficients) is transient. 
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3.2 Infrared bound 

The principal claim of this section is that the finiteness of the integral in 10.6) is 
sufficient for the existence of a symmetry-breaking phase transition in many spin 
systems of the kind ( |3.1| . The reason is the connection of the above random walk to 



the Gaussian free field ( 2.16} (GFF) with P(x,y) := ] xy . Indeed, consider the field 



in a square box A with, say, zero boundary condition. It turns out that 

Cov A (>x, <p y ) = p x( x n = V, Tac =y) =: G A {x,y) (3.12) 

n>0 

where t^c is the first exit time of the walk from A and Ga denotes the so called 
Green's function in A. In particular, we have 

Var A (4> ) = G A (0,0) (3.13) 

which, as we will see, tends to the integral ( |3.6| l as A f Z rf . Since Ea(^o) = due 
to our choice of the boundary condition, we conclude 

(Law(^o) : A C Z d ] is tight iff (X„) is transient (3.14) 

Physicists actually prefer to think of this in terms of symmetry breaking: Formally, 
the Hamiltonian of the GFF is invariant under the transformation (p x — > <p x + c, 
i.e., the model possesses a global spin-translation symmetry. The symmetry group 
is not compact and so, to define the model even in finite volume, the symmetry 
needs to be broken by boundary conditions. The existence of a limit law for <p$ 
can be interpreted as the survival of the symmetry breaking in the thermodynamic 
limit — while non-existence means that the invariance is restored in this limit. 

Our goal is to show that qualitatively the same conclusions hold also for the 
0(n)-spin system. Explicitly, we will prove: 

Theorem 3.2. Let (J xy ) be one of the 3 interactions above. Then: 
Global rotation symmetry 

Random walk driven 

of O{n)-model is broken •<===> 

by (Jxy) is transient 

at low temperatures 

We begin with the proof of the implication <=. The principal tool will be our 
next theorem which, for technical reasons, is formulated for torus boundary con- 
ditions: 



Theorem 3.3 (Infrared bound). Let L be an even integer and consider the model (3.1 1 
on torus TY with Gibbs measure ]it,9>- Suppose (J xy ) is one of the three interactions above 
and let 

c Lt p(x) := E^Sq ■ S x ) (3.15) 
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Define c Lr p(k) := £xgt l c L/ p(x)e lhx . Then 

M*) ^ TpT^W)' k e Ta {0} (3 ' 16) 

w/zere t/ is £/ze dimension of the spin vectors and T£ is the reciprocal torus, T£ := 
{¥(n 1 ,... / n d ):n i = 0,...,L-l}. 

The proof will require developing the technique of reflection positivity and is 
therefore postponed to Section [5] 

Note that Ci^(x) is the spin-spin correlation function which, in light of transla- 
tion invariance of is a function of only the spatial displacement of the two 
spins. The result has the following equivalent formulation: For all (v x ) £ C Tl 
with £ x v x = 0, 

£ v x v y E n ^S ■ S x ) < — v x v y G L {x,y) (3.17) 

x,yeT L A P x,yGT L 

where 

1 . &H x -y) 

Gdx,y):=- d £ (3.18) 

^ JteT*\{0} 1 ■* v / 

Observe that the latter is the covariance matrix of the GFF on Ti, projected on the 
set of configurations with total integral zero (i.e., on the orthogonal complement 
of constant functions). This is a meaningful object because while the cp x are not 
really well defined — due to the absence of the boundary — the differences (p y — (p x 
are. (These differences are orthogonal to constant functions, of course.) A short 
formulation of the infrared bound is thus: 



The correlation of the spins in models with one of the three interactions above 
is dominated — as a matrix on the orthogonal complement of constant functions 
in L 2 (Ti) — by the covariance of a GFF. 

This fact is often referred to as Gaussian domination. 
3.3 Spin-wave condensation in 0(n)-model 

Having temporarily dispensed with the IRB, we will continue in our original line 



of thought. Theorem 3.3 implies: 



Corollary 3.4 (Spin-wave condensation). Suppose \S X \ = 1. Then 
E 



1 2\ 

E S * ) >!-^G L (0,0) (3.19) 



L d 
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Proof. Let S\ := E xG t l S^e 1 *'* be the Fourier coefficient of the decomposition of (S x ) 
into the so called spin waves. The IRB yields 



< 



2/3 !-/(*)' 



*eT£\{0} 



On the other hand, Parseval's identity along with | S 

I 2 = L d E IS, 



1 implies 



E I 5 * 



2 = L M 



xel. 



(3.20) 



(3.21) 



The IRB makes no statement about So so we split it from the rest of the sum: 

1 .* . 



L 2dl 



Now take expectation and apply ( [3.20 (: 



1 



E 

fcGT*\{0} 



(3.22) 



E f— I 



> 1 



V 1 
27 I? 



E 



teT*\{0} 



WW 



(3.23) 



In light of j3l8) , this is pl9] | . □ 

With ( 3.19) in the hand we can apply the same reasoning as for the GFF: In 
the transient cases, Gl (0,0) converges to the integral (3^61 and so the right-hand 
side has a finite limit. By taking B sufficiently large, the limit is actually strictly 
positive. This in turn implies that the zero mode of the spin-wave decomposition 
is macroscopically populated — very much like the free Bose gas at Bose-Einstein 
condensation. Here is how we pull the corresponding conclusions from Ti onto 

Theorem 3.5 (Phase coexistence in O(n)-model). Consider the O(n)-model with n > 
1 and one of the three interactions above. Let 



5o := 



d/c 



-71,77]-' {2ny 1 - ](k) 



(3.24) 



Then for any B > Bq and any 9 G S n 1 there exists \iq G which is translation invariant 
and ergodic such that 



e, 



u e -a.s. 



(3.25) 



x<=A 



for some m± = m±(B) > 0. 
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Note that ( 3.25| l implies that the measures jig are mutually singular with respect 
to one another. Note also that /3o is finite — and the statement is not vacuous — if 
and only if the associated random walk is transient. 

Proof. Suppose, without loss of generality that we are in the transient case, i.e., 
j6o < oo. The idea of the proof is quite simple: We use ( 3.19} to show that the free 
energy is not differentiable in an appropriately-chosen external field when this 
field is set to zero. Then we apply Theorem 2.5 to conclude the existence of the 
required distinct, ergodic Gibbs measures. 
Fix 6 G S"- 1 and define 



1 



(3.26) 



(The limit exists by Theorem 2.5 ) We want to show that <J+/(0) > (and thus, by 
symmetry, gjp/(0) < 0). Corollary 3.4 yields 



- 2dl S \ 2 )>^°+o(l) 



Since | So | < L d , for any < e < 1 we have 



and so 



E^(L- 2d |5 | 2 )< e + ^(|S |>eL d ) 

l/5-/3o 



> 



+ o(l) 



(3.27) 



(3.28) 



(3.29) 



By the 0(n) symmetry of the torus measures jii «, the law of Sq/L is rotationally 
invariant with non-degenerate "radius" distribution. This implies 



l/3-/3o 



FM(0-So>4 p 



> 0^ + 0(1) 



(3.30) 



where C2 := 1 /e and, in general, C„ > is an explicitly obtainable constant. But this 
means that the exponent in 
positive probability and so 



means that the exponent in the definition of / is at least 1 £ ^° ]_, d with uniformly 



9/ 



dh~ 



> 



h=0~ 



P-fr 
4/3 



(3.31) 



Applying Theorem 2.5 for /3 > /3o and any 6 G S 1 there exists a translation invari- 
ant, ergodic Gibbs state ]Iq G such that 



Eu e {0 ■ S x ) 



dh+ 



h=0 



> 



(3.32) 



Next we need to show that the states \i$ are actually distinct. The Ergodic Theorem 
implies 
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(3.33) 



JGA L 



where G S" 1 and where m* > is the magnitude of the derivative. Note that, in 
light of ( 3.32| and the translation invariance of ye, 



m+9-9 



9/ 



dh+ 



h=0 



(3.34) 



The distinctness of y@ will follow once we prove ( |3.25[ ) r i.e., 9 = 9. (This is, of 
course, intuitively obvious because the way we constructed fig indicates that the 
law of S x under yg should be biased in the direction of 9.) 

Suppose 9^9. Find a rotation A £ 0(n) such that A9 = 9. Let p be the 
measure such that E^(/(S)) := £ f , e (f(AS)) for all local functions /. (The existence 
of such a measure follows from the Kolmogorov Extension Theorem.) Since both 
the Hamiltonian and the a priori measure are 0(n) -invariant, we have p £ ©«. But 
H3.33D impl ies Ep le {S x ) — m* 9, and so from ( |3.34| | we have 



E p (9-S x ) = E Me (9-AS x )=m+\9\ 



6^8 



dh- 



h=0 



(3.35) 



As p is a Gibbs measure, this contradicts the general bounds in Theorem 2.5 Hence, 
we must have 9 = 9 after all. □ 

The above statement and proof are formulated for the specific case of the O(n) 
model. A similar proof will apply the existence of a symmetry-breaking phase tran- 
sition at low temperatures in the Ising, Potts and the liquid-crystal models in all 
transient dimensions. As the Ising and Potts model have only a discrete set of spin 
states, a symmetry-breaking transition will occur generally in all dimensions d > 2. 
However, this has to be proved by different methods than those employed above 
(e.g., by invoking chessboard estimates). 



Our next goal is to establish the complementary part of Theorem 3.2 i.e., the 
implication =>, which asserts the absence of symmetry breaking in the recurrent 
cases. This argument predates the other direction by 20 years and bears the name 
of its discoverers: 

Theorem 3.6 (Mermin-Wagner theorem). Let n > 2 and consider the O(n)-model 
with non-negative interactions constants (J Xi y) satisfying the conditions (11,12) from 
Sect. 3.1 Suppose the corresponding random walk is recurrent. Then every y. S 0r is 



invariant under any simultaneous (i.e., homogeneous) rotation of all spins. 



Proof. We will show that the spins can be arbitrarily rotated at an arbitrary small 
cost of the total energy. (This is why we need n > 2.) We will have to work with 
mhomogeneous rotations to achieve this, so let cp x be a collection of numbers with 
{x: cp x 7^ 0} finite and let iR be a unit element of the Lie algebra o(n), i.e., e lRlx is 



Phase transitions in lattice models 23 



a rigid rotation of the unit sphere by angle a about a particular axis. Let co,p be the 
map on configuration space acting on individual spins via 

w v {S x ) := e^ R S x , x G Z d (3.36) 

To investigate the effect of such an inhomogeneous rotation on the Hamiltonian, 
note that 

co <p (S x )-co q> (S y ) = S x -e i ^-^ R S y 

= S x -S y -S x -[l-e^y-^ R }S y 
Hence the energy of a configuration in any block A D {x: cp x ^ 0} transforms as 

H A (cv 9 {S)) = H A (S) + AH (3.38) 

where 

AH := \ £ J xy S x -[1- e^y-^ R ]S y (3.39) 

x,y 

Using that AH depends only on the portion of the spin configuration in A, a simple 
application of the DLR condition shows that, for any local function /, 

E y (foa> (p ) = E H (fe-P AH ) (3.40) 

We will now let <p x — > a in a specific way that ensures AH^O; this will permit us 
to extract the desired conclusion by limiting arguments. 

First we will need to control the ^-dependence of AH, so we expand the expo- 
nential: 

AH = Y^lxy (S X ■ RSy)((py — f x ) 

X,V 

(3.41) 



4 

x,y 

In the first term we note that the self-adjointness of R — valid by the choice of iR 
as an element of the Lie algebra — implies that } xy (S x ■ RS y ) is symmetric under 
the exchange of x and y. Since (q> y — cp x ) is antisymmetric and finitely supported, 
the sum is zero. Estimating the remainder by the quadratic term, we thus get 

|AH| <CY J Jxy{<Py-q>x) 1 = 1CS 1 - ] { ( p, ( p) (3.42) 

x,y 

for some constant C < oo. Here we used that (RS X ■ RS y ) is bounded and re- 
called the definition of the Dirichlet form •) of the random walk driven by 
the (Jx,y)'s. 

Our next task will be to control the Dirichlet form under the condition that q> 
tends to a in every finite set. To that end we fix < R < oo and set 

^:=«Px(t <t A c) (3.43) 
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This function equals a. at x = 0, zero on A^> and is harmonic (with respect to the 
generator of the random walk) in Ar \ {0}. A calculation shows 



y 

Y,hy{<>i-9y) (3 ' 44) 



y 

a. v 

harmonic or y 
zero in {0} c 



But the recurrence of the associated random walk implies that (p y — > a as R — > oo 
for every y and since the J X y's are summable, the right-hand side tends to zero by 
the Dominated Convergence Theorem. Thus AH — > as R — > oo and so applying 
R -> oo to ( |3.40| ) with the choice ( |3.43| yields 

E li (foa> K ) = E F (f) (3.45) 

for every continuous local function /. Thereby we conclude that y is invariant un- 
der simultaneous rotation of all spins. □ 



3.4 Literature remarks 

The content of the entire section is very classical. The Infrared Bound (and its proof 
based on reflection positivity) was discovered in the seminal work of Frohlich, Si- 
mon and Spencer |B0| from 1976 where it was also applied to prove a phase tran- 
sition in the 0(n)-model (as well as the isotropic Heisenberg and other models). 
Dyson, Lieb and Simon |38| showed how to adapt the method to a (somewhat more 
limited) class of quantum spin models. The technique was further developed and 
its applications extended in two papers of Frohlich, Israel, Lieb and Simon 11461147) . 
Thanks to the representation ( 2.12| , the proof of a long-range order in the liquid- 



crystal model, derived by Angelescu and Zagrebnov (6|, follows the same route as 
for the 0(n) model. However, the type of long-range order that is concluded for 
the actual spin system is different. Indeed, let y be a (weak) cluster point of the 
torus states. Then 

lim £ [(So • S x ) 2 - V n ] > (3.46) 

L ^°° \ A L\ x £a l 

with a positive probability under y. (The limit exists by the Pointwise Ergodic The- 
orem.) As y is 0(n)-invariant, if S x were asymptotically independent of So for 
large x, we would expect £^(So • S x ) 2 — > 1 / n as \x\ — > oo. Apparently, this is not 
the case, the direction of S x remains heavily correlated with the direction of So for 
arbitrary x, i.e., there is an orientational long range order. 

Whether or not the 0(n) symmetry of the law of S x is broken is an open (and 
important) question. (The law of each individual S x is invariant under the flip S x <-» 
— S x and so the magnetization is zero in all states.) As noted before, other models of 
liquid crystals based on dimers on Z 2 were considered by Heilmann and Lieb [62| 
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and Abraham and Heilmann [lj prior to the work [6J. There an orientational long- 
range order was proved using chessboard estimates; the question of absence of 
complete translational ordering (i.e., breakdown of translation invariance) remained 
open. 

The Mermin-Wagner theorem goes back to 1966 l[82l . Various interesting math- 
ematical treatments and extensions followed |34[ [871 [49|; the argument presented 
here is inspired by the exposition in Simon's book |(97| . A fully probabilistic ap- 
proach to this result, discovered by Dobrushin and Shlosman [34], has the ad- 
vantage that no regularity conditions need to be posed on the spin-spin interac- 
tion provided it takes the form V(S X — S y ); cf the recent paper by Ioffe, Shlosman 
and Velenik [65|. Finally, we remark that a beautiful and more in-depth exposi- 
tion of this material — including quantum systems — was presented at the Prague 
School in 1996 by Balint Toth; his handwritten lecture notes should be available 
online Il03l . 

The basis of the Mermin-Wagner theorem, as well as its extension, is the contin- 
uum nature of the spin space. Indeed, in the Ising (and also Potts) model, a low- 
temperature symmetry breaking occurs even in some recurrent dimensions; e.g., 
in cf = 2 for the nearest-neighbor interactions. For what determines the presence 
and absence of symmetry breaking in d = 1, see the work of Aizenman, Chayes, 
Chayes and Newman [3| and references therein. 

The connection with random walk is, of course, made possible by our choice to 
work with non-negative couplings. However, most of the quantitative conclusions 
of this section hold without reference to random walks. For detailed expositions of 
the theory of random walks we recommend the monographs by Spitzer [ 100 1 and 
Lawler |76l| ; the material naturally appears in most graduate probability textbooks 
(e.g., Durrett 1551 1 

It is interesting to note that even in d = 2, the nearest-neighbor O(n) model 
exhibits a phase transition when n = 2. Namely, while the Gibbs state is unique at 
all j6 < oo, for large j5 it exhibits power law decay of correlations with /5-dependent 
exponents. This regime is (again, after its discoverers) referred to as the Kosterlitz- 
Thouless phase II70I . A rigorous treatment exits, based on renormalization theory 
and connection with Coulomb gas, thanks to the pioneering work of Frohlich and 
Spencer 15TH ; see also more recent papers by Dimock and Hurd |3"T] . This is of 
much interest in light of recent discovery of new conformally-invariant planar pro- 
cesses — the Schramm-Loewner evolution (a.k.a. SLE). No such phenomenon is 
expected when n > 3 though there is a minor opposition to this (e.g., Patrasciou 
and Seiler E3). 

4 Infrared bound & Mean-field theory 

In this chapter we will discuss how the infrared bound can be used to control the 
error in so-called mean-field approximation. Unlike the spin-wave condensation, 
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which is concerned primarily with the infrared — i.e., small-A: or large spatial scale 
— content of the IRB, here will make the predominant use of the finite-A: — i.e., 



short range — part of the IRB. (Notwithstanding, the finiteness of the integral (3.6 1 
is still a prerequisite.) 

4.1 Mean-field theory 

Mean-field theory is a versatile approximation technique frequently used by physi- 
cists to analyze realistic physical models. We begin by a simple derivation that 
underscores the strengths, and the shortcomings, of this approach. 

Consider a lattice spin model with the usual Hamiltonian ( |3.1) . Pick a transla- 
tion invariant Gibbs measure ]i £ ®6 consider the expectation of the spin at 
the origin. The conditional definition of Gibbs measures (the DLR condition) al- 
lows us to compute this expectation by first conditioning on all spins outside the 
origin. Indeed, the one-spin Gibbs measure is determined by the (one-spin) Hamil- 
tonian 

%} (S) = - E fa So • S x = -So • £ Jo, x S x = -So ■ Mo (4.1) 

X X 

where we introduced the shorthand 

M :=X>,*S* (4.2) 



Thus, by the DLR, 



EJS ) = eJ ?V_, ' (4.3) 



where, abusing the notation slightly, the "inner " expectations are only over So — 
Mo acts as a constant here — and the outer expectation is over the spins in Z rf \ {0}, 
and thus over Mo- 

So far the derivation has been completely rigorous but now comes an ad hoc 
step: We suppose that the random variable Mo is strongly concentrated about its 
average so that we can replace it by this average. Denoting 

m:=E ¥ {S ) (4.4) 

we thus get that m should be an approximate solution to 

E,o(e^) ( } 



This is the so called mean-field equation for the magnetization. 
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Besides the unjustified step in the derivation, a serious practical problem with 
( |4.5| l is that it often has multiple solutions. Indeed, for the set of points (/3,m) that 
obey this equation, one typically gets a picture like this: 



/// 



Here, for j5 < fio, the only solution is m = — this is always a solution whenever 
E^g (S) =0 — but at f> = /So, two new branches appear and coexist over an interval 
of j6's. It is clear that as f> varies, the "physical" solution must undergo some sort of 
jump, but it is not possible to tell where this jump occurs on the basis of equation 
( |4.5| l alone. For that one has to go beyond the heuristic derivation presented above. 

As is standard, one comes up with an additional "selection" principle that de- 
termines which solution is "physical." At the level of classical thermodynamics, 
this is done by postulating that the solution must minimize an appropriate free 
energy function. In the choice of this function we will be guided by the fact that 
there is a proper statistical-mechanical system for which the above derivations can 
be explicitly validated by way of large-deviation theory. This system is the corre- 
sponding model on the complete graph. 

Consider a graph on N vertices with each pair of vertices joined by an undi- 
rected edge. At each vertex x = 1, . . . ,N we have a spin S x with i.i.d. a priori law jiQ. 
Each spin interacts with every other spin; the interaction Hamiltonian is given by 

i N 

H N (S):=-— £ S x -S y (4.6) 

x,y=\ 

The normalization by 1 /n ensures that the energy grows proportionally to N; 
the "2" in the denominator compensates for counting each pair of spins twice. 

To derive the formula for the free energy function, consider first the cumulant 
generating function of the measure }Iq, 

G(h) :=log£ Fo (e /! S ), h£R v (4.7) 

Its Legendre transform, 



y(m) := inf \G(h) - h ■ ml (4.8) 
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defines the entropy which, according to Cramer's theorem, is the rate of large- 
deviation decay in 

Ho ( E S * ~ mN ) = e- Ny '^ + °W (4.9) 

(The function is infinite outside Conv(Q), the convex hull of Q and the set of possi- 
ble values of the magnetization.) Next we inject the energy into the mix and look at 
the Gibbs measure. To describe what configurations dominate the partition func- 
tion, and thus the Gibbs measure, we identify the decay rate of the probability 

^o(e^ E ^ s - s n {ExS ^ mN} ) = e-^W+»(«) (4.10) 

Here the rate function 

®p(m) := -^\m\ 2 -y(m) (4.11) 

is the desired mean-field free-energy function, free energy The physical solutions are 
clearly obtained as the absolute minima of m i— > <f?a(m). This is actually completely 
consistent with (14 .5b: 



Lemma 4.1. We have 

V<^(m)=0 m = VG(pm) (4.12) 



Explicitly, the solutions to ( |4.5[ ) are in bijection with the extreme points ofm \— > <E>fl(m). 

Proof. This is a simple exercise on the Legendre transform. First we note that 
VO^(ffl) = is equivalent to /3m = — VJ^(wi). The convexity of G implies 
that there is a unique h m such that y(m) = G(h m ) — m ■ h m . Furthermore, h m 
depends smoothly on m and we have VG(/i m ) = m. It is easy to check that 
then Vy(m) = —h m . Putting this together with our previous observations, we 
get that 

VO /J (m)=0 fim = h m m = VG(f>m) (4.13) 

It remains to observe that m = VG(j6m) is a concise way to write ( |4.5| >. □ 



Lemma 4.1 shows that the appearance of multiple solutions to ( |4.5| l coincides 
with the emergence of secondary local maxima/ minima. 



4.2 Example: the Potts model 

It is worthwhile to demonstrate the above general formalism on the explicit ex- 
ample of the Potts model. We will work with the tetrahedral representation, i.e., 
on the spin space Q := . . . , v ? }. The mean-field free energy function is best 
expressed in the parametrization using the mole fractions, x\, . . . , Xq, which on the 
complete graph represent the fractions of all vertices with spins pointing in the 
directions v\, . . ., v ?/ respectively. Clearly, 
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i=i 



The corresponding magnetization vector is 



m 



xivi H hi 9 v 



qvq. 



In this notation we have 



E 

fc=i 



Xjt + *k logXfc 



(4.14) 



(4.15) 



(4.16) 



It is not surprising, but somewhat non-trivial to prove (see [9. Lemma 4.4]) that 
all interesting behavior of occurs "on-axes" that is, the absolute minimizers — 
and, in fact, all local extrema — of occur in the directions of one of the spin 
states. (Which direction we choose is immaterial as they are related by symmetry.) 
The following picture shows the qualitative look of the function m i— > <&«(tnvi) at 
four increasing values of j6: 





Here the function first starts convex and, as j8 increases, develops a secondary local 
minimum (plus an inevitable local maximum). For /3 even larger, the secondary 
minimum becomes degenerate with the one at m = and eventually takes over 
the role of the global minimum. With these new distinctions, the plot of solutions 
to the mean-field equation for the magnetization becomes: 



m 



local max 



local min 




Pt P 

Note that the local maximum eventually merges with the local minimum at zero 
— at which point zero becomes a local maximum. The jump in the position of the 
global minimum occurs at some /3 t , which is strictly larger than the point jSo where 
the secondary minima/ maxima first appear. 
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4.3 Approximation theorem & applications 

The goal of this section is to show that, with the help of the IRB, the conclusions 
of mean-field theory can be given a quantitative form. Throughout we restrict our- 
selves to interactions of the form and the coupling constants being one of the 
3 types above. 

Definition 4.2. We say that a measure \i S is a torus state if it is either a (weak) 
cluster point of measures yi^a or can be obtained from such cluster points by perturbing 
either [5 or uq or the inner product between spins. 

The reason for the second half of this definition is that the "operations" thus 
specified preserve the validity of the IRB. For such states we prove: 

Theorem 4.3. Suppose \S X \ < 1. Let u S &pbe a translation-invariant, ergodic, torus 
state and define 

m+ := E F {S ). (4.17) 
Let be the mean-field free energy function corresponding to this model. Then 

<S>fs{m*)< inf ^( m ) + Ylx i (4.18) 

meConv(fi) I 



where 



T r =l ^JWL (419) 



Note that the integral is finite iff the random walk corresponding to (Jxy) is 
transient. However, unlike for Green's function, represents the expected number 
of returns back to the origin after the walk has left the origin. Thus, in strongly 
transient situations one should expect that Id is fairly small. And, indeed, we have 
the following asymptotics: 

• n.n. interactions: 

Zd~~, d^oo. (4.20) 



Yukawa potentials: If d > 3, 



Id < Cfi d . (4.21) 



• Power-law potentials: l£d>3 OR s < mm{d + 2, 2d}, 

I d <C{s-d). (4.22) 

Of course, one is able to make the integral small for interactions with power law 
tails even when s is not too close to d: Just take a mixture of Yukawa and power-law 
with positive coefficients and let }i be sufficiently small. Within the class of above 



models, we can rephrase Theorem 4.3 as: 
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Physical magnetizations nearly minimize 
the mean-field free energy function 

This is justified because, as it turns out, all relevant magnetizations can be 
achieved in ergodic torus states. Let us again demonstrate the conclusion on the 
example of the ^-state Potts model: 

Theorem 4.4. Let q > 3 and suppose that <C 1 / Cj . Then there is fit € (0, oo) and 
translation-invariant, ergodic measures vq, v\,...v q S <5p t such that 



and 



\E Vo {S x ) \ < 1 



E Vj (S x ) = m+Vj, j = l,...,q, 



(4.23) 



(4.24) 



where m* > V2. In particular, the 3-state Potts model undergoes a first-order phase tran- 
sition provided the spatial dimension is sufficiently large. 



This result is pretty much the consequence of the pictures in Sect. 4.1 Indeed, 
including the error bound ( |4.18 l, the physical magnetization is confined to the 
shaded regions: 




Thus, once the error is smaller than the "hump" separating the two local minima, 
there is no way that the physical magnetization can change continuously as the 
temperature varies. This is seen even more dramatically once we mark directly into 
the mean-field magnetization plot the set of values of the magnetization allowed 



by the inequality ( |4.18[ ): 



r 
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(To emphasize the effect, the plots are done for the q = 10 state Potts model rather 
than the most interesting case of q = 3.) Notice that the transition is bound to occur 
rather sharply and very near the mean-field value of /3 t ; explicit error bounds can 
be derived, but there is no need to state them here. 

An additional argument is actually needed to provide a full proof of (424). In- 



deed, we claim that the symmetry breaking happens exactly in the direction of one 
of the spin states while the approximation by mean-field theory only guarantees 
that the expectation is near one of these directions. 

Proof of ( 4.24| , sketch. Consider an ergodic Gibbs state y. with := £^(S X ) ^ 



at inverse temperature /3. Given a sample a = (o~ x ) from ji, at each unordered 
pair (x, y) of vertices from Z d let 

Vxy ■= l Wx=(ry} Z xy (4.25) 

where (Z xy ) are a priori independent, zero-one valued random variables with 

F(Z xy = 1) = 1 - F{Z xy = 0) := 1 - e"M* (4.26) 

This defines a coupling of u with a random cluster measure — the distribution of 
the n's — which, by the fact that the extension comes from i.i.d. random variables, 
is also ergodic. 

When m* ^ 0, the //-marginal features a unique infinite connected component 
of edges (x, y) with n xy = 1 whose (site) density is proportional to | m* \ . By the con- 
struction, the spin variables take a (constant) value on each connected component, 
which is a.s. unique (by ergodicity) on the infinite one and uniform on the finite 
ones. Thus, the bias of the spin distribution comes only from the infinite compo- 
nent and so it points in one of the q spin directions. The claim thus follows. □ 



4.4 Ideas from the proofs 

A fundamental technical ingredient of the proof is again provided by the IRB, so 
throughout we will assume one of the three interactions discussed above. How- 
ever, we will need the following enhanced version: 

Lemma 4.5 (IRB enhanced). Suppose the random walk driven by the (J X y) is tran- 
sient and let G(x,y) denote the corresponding Green's function on Z d . Let p. S &p be 
a translation-invariant, ergodic, torus state and let us denote m* := Ep t (So). Then for 
all {v x ) xeX d G C z<i with finite support, 



Y J v x v y E }l {{S x -m i< ) ■ (Sy-m*)) < ^J^v x v y G(x,y). 

x,y Z P x,y 



(4.27) 
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Proof. The IRB on torus survives weak limits and so we know that, for every (w x ) 
with finite support and Y, x w x = 0/ 



Y^w x WyE F (S x ■ S y ) < — Yj w xWyG{x,y) 

x,y L P x,y 



(4.28) 



G(x,y) := lim G L (x,y) 

L^oo 



(4.29) 



where 

dk e ih{x-y) 

-71,71]' ( lTl ) d 1 - W) 

What separates (428) from ( 4.27| l are the m* terms in the expectation on the left 
and the absence of the restriction on the sum of v x . The former is remedied easily; 
indeed, the restriction Yax w * = allows us to put the terms at no additional 
cost. 

To address the latter issue, suppose (v x ) has finite support but let now Yi,x v x 
be arbitrary. Let A[ C Z 1 * contain the support of (v x ). To convert to the previous 
argument, let 



A, 



E^ 



and 



w x := v x -a L l AL (x) 

Note that £ x w x = 0. Then 

Y J ^>xWyE }t ((S x - nu) ■ (Sy - m*)) = Y^v x v y E }l ((S x - 



(4.30) 
(4.31) 



[Sy-m±)) 



x,y 



2E h 



«l ^ (Sj-m*) • J^Vy{Sy-m+) 

xga l y 



+ E U 



«L E ( Sx ~ m *) 

XGA L 



(4.32) 



But ergodicity of }i implies that 

1 



A, 



E ( s *- m *) 



XGA L 



L^oo 



(4.33) 



and so, by Cauchy-Schwarz, the last two terms in ( 4.32| converge to zero as L — > oo. 
Now apply ( |4.28| l and pass to the limit L — > oo there. A direct calculation (and the 
Riemann-Lebesgue lemma) shows that 



- — r G(x,y) - 



(4.34) 



and so the terms involving on the right-hand side of ( 4.27| suffer a similar fate. 
This means that the left-hand sides of 1 4.27 - 428) tend to each other, and same for 
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the right-hand sides. The desired bound (4.27| is thus a limiting version of (428). 

□ 

Clearly, the restriction to finitely-supported (v x ) is not necessary; instead, one 
can consider completions of this set in various reasonable norms. The above for- 
mulation has an immediate, but rather fundamental, consequence: 

Corollary 4.6 (Key estimate). Let pi G 0« be an ergodic torus state and let m* := 
Eu(S x ). Then we have 



Jo,x 



2X y 
- if, d 



(4.35) 



Proof. Choose v x '■= Jo, x and note that with this choice the left-hand side of ( 4.27[ ) 
becomes the left-hand side of ( 4.35| |. As to the right-hand side of ( 4.27| |, we get 



dk e ik < x -^ 



-T , 



h,xh r y 



(4.36) 



Recalling the definition of }(k), this yields the desired error term. □ 

This corollary provides a justification of the ad hoc step in the derivation of 
mean-field theory: Indeed, once 1^ is small, the variance of Mo is small and so Mo 
is with high probability close to its average. 

The rest of the proof of Theorem 4.3 is based on inequalities linking the mean- 
field free energy with the actual magnetization of the system; this part of the proof 
works for general non-negative coupling constants satisfying conditions (11-12) 
from Sect. 13.11 The relevant observations are as follows: 



Proposition 4.7. Let pi G <3pbe translation invariant and let m* := E^(S X ). 
(l)We have 



Oa(w*)< inf <I>«(m) + £ V / 0< x[E H (S • S x ) 

iwGConv(fl) 2. 



(4.37) 



(2) Suppose also Jo,* > and \S X \ < 1. Then 

£ Jo,x [MSo ■ Sx) - |m*| 2 ] < f> E }t ( £ Jo,* S x 



(4.38) 



Proof of (1). The proof is based on convexity inequalities linking the mean-field free 
energy and the characteristics of the actual system. Fix A C Z d and let Za be the 
partition function in A. A standard example of such convexity inequality is 

Z A >exp{-|A| inf 0«(m) + 0(3A)). (4.39) 

meConv(n) J 
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To prove this we pick m in the (relative) interior of Conv(Q) and define a tilted 
measure 

F h (dS) := e h S - G W}i (dS) (4.40) 

with h adjusted so that Eu h (S) = m. (Such h exists for each m in the relative interior 
of Conv(Q), by standard arguments for the Legendre transform.) We then get 

Z A = E mih ( e -PH A (S)-fc.M A +|A|G(ZO) (4 41) 

where we introduced the shorthand 

M A := X] (4.42) 

xeA 

Now apply Jensen to get the expectation into the exponent; the product nature 
of implies that E® m (H A (S)) = -|A|||m| 2 + 0(9A) and so ( |4.39[ ) follows by 
noting that G(h) — h ■ m = y(m) due to our choice of h, and subsequently opti- 
mizing over all admissible m. 

Now fix a general h S W and let ]i be a Gibbs measure as specified in the claim. 
First we note that the DLR condition implies 

E ^ e +PH A+ h.M AZA) = E Wo (e h - M -) = e^ G ^ (4.43) 
The Z A term can be bounded away via ( |4.39[ ); Jensen's inequality then gives 

jS E H (H A ) + |A| h ■ m* - |A| inf <E>fi(m) + 0(3A) < |A|G(/z) (4.44) 

meConv(n) 

Next, translation invariance of ^ yields 

E H (H A ) = -|A|^/o,xE^(S -S. T ) + 0(9A) (4.45) 

X 

and so dividing by A and taking A f Z rf along cubes gets us 

-^X>,x£,,( S o-S*) - inf ^(m) < G(fc) - /i • m* (4.46) 

^ x meConv(n) 

Optimizing over /i turns the right-hand side into y(m+). Adding 2l m *| 2 on both 
sides and invoking ( 4.11| | now proves the claim. □ 

Proof of (2). Let us return to the notation Mo := Y^xh,x$x- The left-hand side of 
( 4.38 1 can then be written as E f( (So • Mo) — |w2*| 2 . Since /o,o = 0, an application of 
the DLR condition yields 

£^(M • S ) = E fl (Mo ■ VG( J 6M )) (4.47) 

The DLR condition also implies 
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= E pt {M ) = E n [VG(pM )] (4.48) 

and so we have 

E M (S ■ M ) - \m*\ 2 

= E /< ((Mo-m*)- (VG(0M o )-VG(0m*))) (4.49) 

But |S X | < 1 implies that the Hessian of G is dominated by the identity, VVG(m) < 
id at any m £ Conv(Q) — assuming J x « > — and so 

(M - m*) • (VG(^Mo) - VG(/5m*)) < 0|M O - m*\ 2 (4.50) 

by the Mean- Value Theorem. Taking expectations proves ( |4.38| |. □ 

Theorem 4.3 now follows by combining Proposition 4.7 with Corollary |4.6| In- 



terestingly, ( 4.37| gives 

£ hxE fl {S -S x ) > \m*\ 2 (4.51) 



i.e., the actual energy density always exceeds the mean-field energy density. 
4.5 Literature remarks 

The inception of mean-field theory goes back to Curie (28 1 and Weiss |105|. One 
of the early connections to the models on the complete graph appears in Ellis' 
textbook on large-deviation theory H40l . Most of this section is based on the papers 
of Biskup and Chayes [9] and Biskup, Chayes and Crawford ffT0| . The Key Estimate 
had been used before in some specific cases; e.g., for the Ising model in the paper 
by Bricmont, Kesten, Lebowitz and Schonmann [22 J and for the ^-state Potts model 
in the paper by Kesten and Schonmann [69|. Both these works deal with the limit 
of the magnetization as d — > oo; notwithstanding, no conclusions were extracted 
for the presence of first-order phase transitions in finite-dimensional systems. 

The first-order phase transition in the ^-state Potts model has first been proved 
by Kotecky and Shlosman [69 j but the technique works only for extremely large q. 
The case of small q has been open. The upshot of the present technique is that it re- 
places q by d or interaction range in its role of a "large parameter." The price to pay 
is the lack of explicit control over symmetry: We expect that the measure v$ in The- 
orem 4.4 is actually "disordered" and E Vo (S x ) = 0. This would follow if we knew 
that the magnetization in the Potts model can be discontinuous only at the perco- 
lation threshold — for the Ising model this was recently proved by Bodineau fl9| 
— but this is so far known only in d = 2 (or for q very large). The coupling in the 
proof of ( 4. 24) is due to Edwards and Sokal [39 J; for further properties see Grim- 



mett [61 j or Biskup, Borgs, Chayes and Kotecky f8]. The uniqueness of the infinite 
connected component is well known in the nearest-neighbor case from a beautiful 
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argument of Burton and Keane f23); for the long-range models it has to be supplied 
by a percolation bound dominating the number of edges connecting a box of side L 
to its complement. 

The requirement <C 1 /q is actually an embarrassment of the theory as the 
transition should become more pronounced, and thus easier to control, with in- 
creasing q. Thus, even for nearest-neighbor case, we still do not have a dimension 
in which all q > 3 state Potts models go first order. (It is expected that this happens 
already in d = 3.) The restriction to transient dimensions is actually not absolutely 
necessary; cf recent work Chayes p4|. 

It is natural to ask whether one can say anything about the continuum-^ ex- 
tension of the Potts model, the random cluster model; see Grimmett |6T1 . Unfor- 
tunately, the main condition for proving the IRB, reflection positivity, holds if and 
only if q is integer (Biskup 0). 

Another model for which this method yields a novel result is the liquid-crystal 
model discussed in Sect. 2.2 Here Angelescu and Zagrebnov [6| proved that sym- 
metry breaking (for the order parameter max^, E^[S^] 2 — V„) occurs at low temper- 
atures by exhibiting spin-wave condensation; cf remarks at the end of Chapter [3] 
In [9 1 it has been shown that, for n > 3, the order parameter undergoes a discontin- 
uous transition at intermediate temperatures; van Enter and Shlosman |4lll42| later 
proved such transitions in highly non-linear cases. Similar "mean-field driven" 
first order phase transitions have also been proved for the cubic model |3 and the 
Blume-Capel model Iflul . 

Once the general theory is in place, the proof of a phase transition for a specific 
model boils down to the analysis of the mean-field free energy function. While in 
principle always doable, in practice this may be quite a challenge even in some 
relatively simple examples. See, e.g., |0 Sect. 4.4] what this requires in the context 
of the liquid-crystal model. 

Finally, we note that the IRB has been connected to mean-field theory before; 
namely, in the work of Aizenman [2] (cf also Frohlich 114511 and Sokal [98 1) in the 
context of lattice field theories and that of Aizenman and Fernandez in the context 
of Ising systems in either high spatial dimensions [4j or for spread-out interac- 
tions [5]. A representative result from these papers is that the critical exponents in 
the Ising model take mean-field values above 4 dimensions. The IRB enters as a tool to de- 
rive a one-way bound on the critical exponents. Unfortunately, the full conclusions 
are restricted to interactions that are reflection positive; a non-trivial extension was 
obtained recently by Sakai 11901 who proved the IRB — and the corresponding con- 
clusions about the critical exponents — directly via a version of the lace expansion. 



5 Reflection positivity 



In the last two sections we have made extensive use of the infrared bound. Now 
is the time to prove it. This will require introducing the technique of reflection 
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positivity which, somewhat undesirably, links long-range correlation properties 
of the spin models under consideration to the explicit structure of the underlying 
graph. Apart from the infrared bound, reflection positivity yields also the so called 
chessboard estimate which we will use extensively in Chapter |6l 



5.1 Reflection positive measures 

We begin by introducing the basic setup for the definition of reflection positivity: 
Consider the torus TY of side L with L even. The torus has a natural reflection sym- 
metry along planes orthogonal to one of the lattice directions. (For that purpose we 
may think of Ti as embedded into a continuum torus.) The corresponding "plane 
of reflection" P has two components, one at the "front" of the torus and the other 
at the "back." The plane either passes through the sites of TY or bisects bonds; we 
speak of reflections through sites or through bonds, respectively. The plane splits the 
torus into two halves, and T7, which are disjoint for reflections through bonds 
and obey n = P for reflections through sites. 

Let 2l ± denote the set of all functions /: Q Tl — > K that depend only on the 
spins in TjK Let i? denote the reflection operator, d: 2l ± — > 21^, which acts on spins 
via 

*(Sx) := S m (5.1) 
Clearly, i? is a morphism of algebra 2l + onto 2l~ and # 2 = id. 

Definition 5.1 (Reflection positivity). A measure u on Q Tl is reflection positive 

(RP) with respect to d if 

(1) Forallf,g G &+, 

E v {f$g) = E }l {gdf) (5.2) 

(2) For all f G 21+, 

E,(/tf/)>0 (5.3) 
Note that the above implies that f,g i— > E fi (f dg) is a positive-semindefinite 



symmetric bilinear form. Condition (5.2 1 is usually automatically true — it requires 
only !?-invariance of u — so it is the second condition that makes this concept non- 
trivial (hence also the name). Here we first note that the concept is not entirely 
vacuous: 

Lemma 5.2. The product measure, u = ®Uq, is RP with respect to all reflections. 

Proof. First consider reflections through bonds. Let/,g € Since n = 0, 
the random variables / and dg are independent under p. Hence, 

E^fdg) = EM)E^g) = E }l {f)E }l {g) (5.4) 



whereby both conditions in Definition 5.1 follow. 
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For reflections through sites, we note that / and dg are independent conditional 
on Sp. Invoking the reflection symmetry of - |Sp), we get 

W*g\S P ) = E }l (f\S P )E }l (dg\Sp) = E tl (f\Sp)E }l (g\S P ) (5.5) 

Again the conditions of RP follow by inspection. □ 

A fundamental consequence of reflection positivity is the Cauchy-Schwarz in- 
equality 

2 (g#g) (5.6) 

Here is an enhanced, but extremely useful, version of this inequality: 

Lemma 5.3. Let ]i he RP with respect to d and let A, B, C K , D x € 2l + . Then 

[E fl (e A+m+L * Cc ' m '')] 2 

< [E pi (e A+t>A+ ^ c ^ c ")] [E F (e B+m+ ^ DM )] (5.7) 



Proof. Clearly, in the absence of the C a #D a terms, this simply reduces to \5.6) . To 
include these terms we use expansion into Taylor series: 

= Zh E £, i ((e A Q 1 ...C a „) i ?(e B D ai ...D a „)) (5.8) 

n>0 --An 



Now we apply 1 5.6 1 to the expectation on the right-hand side and then one more 
time to the resulting sum: 

E h((e A C Kl ...C an )#(e A C Kl ...C Kn )) l/2 



n>0 ' «!,...,«„ 

x E fl ( (e B D ai . . . D a „ ) (e B D ai . . . D a „ ) ) 1/2 
< fE^T E E fl ((e A C ai ...C an )He A C ai ...C an ))) 

\n>0 n - «l,...An ' 

x(Ei E E fl ((e B D Xl ...D an )V(e B D ai ...D a „))) 

\n>0 <n.\,-An ' 



1/2 



(5.9) 



Resummation via ( |5.8| now yields the desired expression. □ 

The argument we just saw yields a fundamental criterion for proving reflection 
positivity: 
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Corollary 5.4. Fix a plane of reflection P and let i? be the corresponding reflection operator. 
Suppose that the torus Hamiltonian takes the form 

- H L = A + #A + £ C a dC a (5.10) 

with A,C K € 2l + . Then for all f> > the torus Gibbs measure, is RP with respect 
tod. 



Proof. The proof is a simple modification of the argument in Lemma 5.3 Fix f ' ,g £ 
Expansion of the exponential term in T^ a Cu #C a yields 

i i L (5-11) 
^E^T E E^ Q ((/e^C ai ---C a J%e^C« a ---Qj) 

Z ' L n>0 n " Kl,-An 



The conditions of RP for p^a are now direct consequences of the fact that the prod- 
uct measure, ® is itself RP (cf Lemma |53) . □ 

Now we are ready to check that all 3 interactions that we focused our attention 
on in previous lectures are of the form in Lemma 5.3 and thus lead to RP torus 
Gibbs measures: 

Lemma 5.5. For any plane P, the n.n. (ferromagnet) interaction, Yukawa potentials and 
the power-law decaying potentials, the torus Hamiltonian can be written in the form (5.10 1 
for some A, C fl 6 

Proof. We focus on reflections through bonds; the case of reflections through sites 
is analogous. Given P, the terms in the Hamiltonian can naturally be decomposed 
into three groups: those between the sites in , those between the sites in T£" and 
those involving both halves of the torus: 



Ht = \ E & Sx-Sy + l E /S ) s * " s y + E E $ ^4° ( 5 - 12 ) 

s v ' N v ' yeTl 



A dA 



where we used the reflection symmetry of the ] xy to absorb the V2 into the sum at 



xy 

the cost of confining x to and y to . The first two terms indentify A and dA; 
it remains to show that the R ; -term can be written as yj a C a i?C a . We proceed on a 
case-by-case basis: 

Nearest-neighbor interactions: Here 
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R i = ^ E Si°sf (5.13) 

which is of the desired form since S y = &(S X ) whenever x and y contribute to the 
above sum. 

Yukawa potentials: We will only prove this in d = 1; the higher dimensions are 
harder but similar. Note that if P passes through the origin and x £ and y G T^, 



/i L ) = r V" p-f(M+lyl+»iO 

Hence, 



W = C E e-^l x l+l^+" L ) (5.14) 

)3>0 



R; = C £ e-''" L ( X] e-^l^ls4°) ( £ e^S^ ) (5.15) 

">0 xeT+ yGTl 



which is of the desired form. 

Power-law potentials: Here we note 

1 



\x-y\l Jo 



duu s - x e-v\ x -y^ (5.16) 



which reduces the problem to the Yukawa case. □ 

We remark that Corollary |5.3| allows a minor generalization: if a torus measure ]i 
is RP, and a torus Hamiltonian Hi takes the form (|5.10[), then also the measure e~P Hl dpi is 



RP. This may seem to be a useful tool for constructing RP measures; unfortunately 
we do not know any natural measures other than product measures for which RP 
can be shown directly. 

5.2 Gaussian domination 

Now we are in a position to start proving the infrared bound. First we introduce 
its integral version known under the name Gaussian domination: 

Theorem 5.6 (Gaussian domination). Let (J X y) be one of the three interactions above. 
Fix f> > and for h = {h x ) xeTh G {R v ) Tl define 

Z L (h):=E® F0 (ex P {-p £ & I s * " s y + h * ~ h y\ 2 }) ( 5 - 17 ) 

Then 

Z L {h) < Z L (0) (5.18) 
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Proof. Let Hi denote the sum in the exponent. It is easy to check that Hi is of the 
form 

-Hl = A + $B + Y^C ci &D a (5.19) 



Indeed, for h = this is simply Lemma |5.5| as the diagonal terms can always by 
absorbed into the a priori measure. To get h ^ we replace S x by S x + h x at each x. 
This changes the meaning of the original terms A and C K — and makes them dif- 
ferent on the two halves of the torus — but preserves the overall structure of the 
expression. 

A fundamental ingredient is provided by Lemma 5.3 which yields 

Z L (h) 2 < Z L (h + )Z L (h~) (5.20) 

where h + := h on and h + := dh on T^, and similarly for h_. Now let us show 
how this yields ( |5.18) : Noting that Zi(h) — > whenever any component of h tends 
to ±oo, the maximum of Zi(h) is achieved at some finite h. Let h* be a maximizer 
for which 

N(h):=#{(x,y):h x ^h y } (5.21) 

is the smallest among all maximizers. We claim that N(h*) = 0. Indeed, if N(h*) > 
then there exists a plane of reflection P through bonds such that P intersects at 
least one bond (x,y) with h* ^ h*. Observe that then 

min{N(/2*), N(h*_)} < N(h*) (5.22) 

Suppose without loss of generality that N(/j* ) < N(h*). Then the fact that h* was 
a maximizer implies 

Z L (^) 2 < Z L (h* + )Z L (Jt) < Z L (h%)Z L (h*) (5.23) 

which means 

Z L (h*) < Z L {h\) (5.24) 

i.e., h\ is also a maximizer. But that contradicts the choice of h* by which N(h*) 
was already minimal possible. It follows that N(h*) = 0, i.e., h* is a constant. 
Since Z(h + c) = Z(h) for any constant c, ( 5.18| follows. □ 

Now we can finally pay an old debt and prove the infrared bound: 
Proof of Theorem 3.3 To ease the notation, we will write throughout 

(7,0:= E VxCx (5-25) 

to denote the natural inner product on L 2 (Ti). First we note that for any (rj x ) S 
(R V ) T S 

E $\n*-ny\ 2 = (n.Gl x n) ( 5 - 26 ) 



x,yeT 



L 



Phase transitions in lattice models 43 

where Gi is as in ( 3.18| . (Indeed, in Fourier components, G^ 1 (/c) = 1 — J(k).) As is 
easy to check, 

Z L (/ J ) = E 0/(o (e^ s ^( S ^)> ) 

= Z L (0)E^(e-W h > G ? s >-K h > G ? h >) 

where is the torus Gibbs measure. The statement of Gaussian domination 
( 5.18 1 is thus equivalent to 

E^-W&s)) < e KKG?h) {5 28) 

We will now use invertibility of Gi to replace G^ l h by h. This yields 

E^e" 2 ^) <e^ h > G M whenever h x = (5.29) 

xeT L 

where the latter condition comes from the fact that G^ 1 annihilates constant func- 
tions. Next we expand both sides to quadratic order in h: 

1 - 2/$E Wj ,«fe,S» + ^E^iKS) 2 ) +... 

<l + P(h,G L h)+... (5.30) 

Since E^ L JS) is constant, E^ i „((/i,S)) = (h,E^ L JS)) =0 and we thus get 

E^((h,S) 2 )<±(h,G L h) (5.31) 

Finally, choose h x := v x ej, for some orthonormal basis vectors e, in R v . This singles 
out the z-th components of the spins on the left-hand side and has no noticeable 
effect on the right-hand side (beyond replacing vectors h x by scalars v x ). Summing 
the result over i = l,...,v we get the dot product of the spins on the left and an 
extra factor v on the right-hand side. □ 



5.3 Chessboard estimates 



The proof of the infrared bound was based on Lemma 5.3 which boils down to the 
Cauchy-Schwarz inequality for the inner product 

f,g^E pi (fVg) (5.32) 

In this section we will systematize the use of the Cauchy-Schwarz inequality to 
derive bounds on correlation functions. The key inequality — referred to as the 
chessboard estimate — will turn out to be useful in the proofs of phase coexistence 
in specific spin systems (even those to which the IRB technology does not apply). 
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Throughout we will restrict attention to reflections through planes of sites as 
this is somewhat more useful in applications (except for quantum systems). Pick 
two integers, B < L, such that B divides L and L /g is even. Fixing the origin of the 
torus, let Ag the block corresponding to {0, 1, . . . , B} d — i.e., the block of side B 
with lower-left corner at the origin. We may cover Ti by translates of Ag, 

T L = |J (Ag + Bt) (5.33) 

noting that the neighboring translates share the vertices on the adjacent sides. (This 
is the specific feature of the setup based on reflections through planes of sites.) The 
translates are indexed by the sites in a "factor torus" T L /g. 

Definition 5.7. A function f: Q Tl — » R is called a B-block function if it depends only 
on {S x : x G Ag}. An event A C Q Tl is called a B-block event if 1a is a B-block 
function. 

Given a B-block function /, and t G T L/B , we define # f / be the reflection of / 
"into" Ag + Bt. More precisely, for a self-avoiding path on T L / B connecting Ag 
to Ag + Bt, we may sequentially reflect / along the planes between the successive 
blocks in the path. The result is a function that depends only on {S x : x G Ag + Bt}. 
Due to the commutativity of the reflections, this function does not depend on the 
choice of the path, so we denote it simply by d t f. Note that since reflections are 
involutive, d 2 = id, there are only 2d distinct functions one can obtain from / 
modulo translations. 

Theorem 5.8 (Chessboard estimate). Suppose }i is RP with respect to all reflections 
between the neighboring blocks of the form Ag + Bt, t G T L/B . Then for any B-block 
functions f\, . . . ,f m , and any distinct t\,...,t m G T L / B , 



tn m r 

Mn<V/)<nM n h) 



' teT L/B 



(b/lY 



(5.34) 



Here is a version of this bound for events: If A\, . . . , A m are B-block events and 
t\,...,t m are distinct elements of Tl/b, then 



m m 



1 (B/L) 



d 



(5.35) 



where 

d t {A) := {Wa = 1} (5-36) 

Note that the exponent ( B /i) d is the reciprocal volume of the torus T L /g. (This is 
consistent with the fact that both expressions transform homogeneously under the 
scaling / ; - — > Ay/y with Ay > 0.) 
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Proof of Theorem 5.8 We will assume throughout that E }l (f df) = implies / = 0. 
(Otherwise, one has to factor out the ideal of such functions and work on the factor 
space.) We will first address the ID case; the general dimensions will be handled 
by induction. 

Abbreviate In := l /b and fix a collection of non-zero functions f\, . . . , fin - Define 
a multilinear functional F on the set of B-block functions by 



F(fi fin) := EJflVtft) (5.37) 



Gift fin) ■.= ^ n yj :rr' J :^ 



111 

n 

' f=l 

Noting that F(ft, • ••//;) > 0, we also define 

Hh fin) 

nj^iHfj f } )* 

These objects enjoy a natural cyclic invariance, 

F(fi,...,f 2n ) = F{f 2 n,fi,...,fm-i) (5.39) 

and, similarly, 

G(/i,...,/ 2 «) = G(/ 2 „, fx,..., fm-i) (5.40) 
The definition of G also implies 

G(f /) = 1 (5.41) 

Finally, Cauchy-Schwarz along the plane separating f\ from f 2n and /„ from f n+ \ 
yields 

G(fi,...,f 2 n) < G(/i,..., fnjn,-.., flf 2 

x G(f 2n ,..., f n+ i,f n+1 ,..., fmf 2 (5.42) 

This will of course be the core estimate of the proof. 
The desired claim will be proved if we show that 

G{f\, ■ ■ ■ , fm) < 1 (5.43) 

i.e., that G is maximized by 2n-tuples composed of the same function. We will 
proceed similarly as in the proof of Gaussian Domination: Given a 2n-tuple of B- 
block functions, [f\, . . . ,f 2n ), let (g\, . . . ,gm) be such that 

(!) Si e ifl' ■ ■ ■ ' fin } for each i = l,...,2n 

(2) G(gi,... ,g2n) maximizes G over all such choices of g\, . . .,g2n 

(3) gi, . . . ,g2n is minimal in the sense that it contains the longest block (counted 
periodically) of the form /;,/;, . . . ,fi, for some i € {1, . . . ,2n}. 
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Let k be the length of this block and, using the cyclic invariance, assume that the 
block occurs at the beginning of the sequence gi,... ,gi n , i-e., we have g\,---,gk = 
fj (with g] (+ i 7^ fj unless k = 2n). 

We claim that k = 2n. Indeed, in the opposite case, k < 2n, we must have 
gin 7^ fi whereby ( |5.42| | combined with the fact that {g\, . . . ,gi n ) is a maximizer 
of G imply 

G(gl, . . .,g2nf < G(gi, . . . ,gn,gn, ■ ■ ■ , gl)G{g 2n , ■ ■ ■ ,gn+\,gn+\, ■ ■ ■ ,gln) ^ ^ 
< G{gi,...,g n ,gn,...,gl)G{gi,...,g ln ) 

i.e., 

G{g!,...,g2n) < G(gi, gn/gn, ■ ■ ■ / gi) (5-45) 

This means that (gi,...,g n ,g n ,...,gi)is also a legitimate maximizer of G but it has 
a longer constant block — namely of length at least mm{2k,2n}. This is a contra- 
diction and so we must have k = 2n after all. In light of ( 5.41{ - 5T43] |, this proves the 
claim in d = 1. 

To extend the proof to d > 1, suppose that m = { l /bY and assume, without loss 
of generality, that we have one function f t for each block Ag + Bt. Writing 



n *tf t =fi( n it) (5-46) 

teT L/B ;=i V teT L/B / 



we may apply the ID chessboard estimate along the product over j. This homog- 
enizes the product over ft in the first coordinate direction. Proceeding through all 
directions we eventually obtain the desired claim. □ 

The chessboard estimate allows us to bound the probability of simultaneous 
occurrence of distinctly-placed B-block events in terms of their disseminated versions 
rifeT L/B $t(<A). The relevant quantities to estimate are thus 

/ \ (b/lY 



teTL/B 

The set function A i— > Il{A) is not generally additive. However, what matters for 
applications is that it is subadditive: 

Lemma 5.9 (Subadditivity). Let A and A\, A2, . . . , be a collection of B-block events 
such that 

A C |J A k (5.48) 

k 

Then 

dL(A) <£u(A k ) (5.49) 
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Proof. First we use the subadditivity of y. and ( 5.48| l to get 

u (A)\ T ^\ = u( n M*j) < Lv( fl MA t )) (5-50) 

Next we apply the chessboard estimate 

A n MAkS) < n (5-51) 

to each term on the right hand side. Finally we apply the distributive law for sums 
and products with the result 



(h) teT L/B 



\1l/b\ (5-52) 



= n E3l(a)= E3l(a: 

feT L/B fc V 

Taking the \Tl/b |"th root now yields the desired claim. □ 

Here is how subadditivity }i is generally used in computations: In order to 
estimate the 3L~value of an event, we first cover it by the union of a collection of 
smaller — and, as desired, easier to compute-with — events, then evaluate the 
3L~value for each of them and, finally, add the results. 

In estimates, we often work with the limiting version, 

i(A) := lim u{A) (5.53) 

L^oo 

of this quantity. We may interpret this as a partition function per site restricted to 
event A on each B-block. The advantage of taking the limit is that it often washes 
out some annoying finite-size factors and thus provides a more tractable expres- 
sion to work with. In addition, the limit can be computed using arbitrary — not 
just periodic — boundary conditions. 



5.4 Diagonal reflections, other lattices 

The above proof of the chessboard estimate is tailored to the underlying setting of 
the hypercubic lattice, primarily because of its use of the orthogonality between the 
principal lattice directions. However, some practical problems may lead us to the 
consideration of other lattices. Some cases generalize directly, e.g., certain instances 
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of the body-centered cubic (BCC) or face-centered cubic (FCC) lattices, whose unit cells 
look respectively as follows: 



Both of these are decorations of the cubic lattice in which an extra vertex placed in 
the center of each unit cube (BCC) or a face (FCC) and is attached by edges to the 
vertices in its ultimate vicinity. 

Assuming the interaction ( 2.15| l with / Xl/ non-zero and positive only for adja- 



cent (i.e., nearest-neighbor) pairs of vertices, the torus Gibbs measure is reflection 
positive for reflections both through and between the planes of sites of Z 3 . (A key 
observation is that the planes between sites of Z 3 contain some of the added ver- 
tices but bisect no additional edges.) The strengths of the interactions across the 
"old" and "new" edges may not even be the same. 

In d = 2, a corresponding graph is the lattice with a vertex placed in the middle 
of each square of Z 2 and edges from it to each of the four corners thereof. By the 
same reasoning, the nearest-neighbor ferromagnetic interaction leads to a reflec- 
tion positive torus Gibbs measure. 

The situation becomes more involved for the triangular (two-dimensional) lat- 
tice, whose standard embedding into the complex plane C has vertices 

m + ne iTT/3 , m,n E Z (5.54) 

and an edge between any pair of such vertices that differ by a number in the set 
{1, e 17r//3 , e l27r / 3 }. The principal problem with such graphs is how to place a finite 
piece of this lattice on a torus in a way that gives rise to reflection positive mea- 
sures. Here is a convenient choice: 
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with the torus obtained by identifying the vertices on the opposite sides. 

The allowed planes of reflection are all horizontal lines (reflections through 
sites) and the vertical lines (reflections through both sites and bonds). Again, for 
ferromagnetic nearest-neighbor interactions, the Gibbs measure with interaction 
( |2.15[ ) is reflection positive. A minor, though annoying, problem occurs in the ap- 
plication of chessboard estimates because the vertical lines of reflections actually 
cut through triangles. A solution is to focus only on those events that lie either on 
white or on gray triangles in the above picture and use reflection only with respect 
to vertical lines that do not cut through the chosen triangles. 

A completely analogous situation occurs for the honeycomb lattice. Here we con- 
sider the domain of the form 




and wrap it into a torus by identifying the vertices on the opposite side. Again, 
for nearest-neighbor ferromagnetic interactions, the resulting Gibbs measure is re- 
flection positive with respect to reflections in vertical lines on sites and horizontal 
lines between sites. In the application of chessboard estimates to a collection of 
"hexagon events," we only use every other horizontal and vertical reflections to a 
corresponding subset of these events; e.g., those sitting on the shaded hexagons. 

A final case of interest is that of diagonal reflections in Z d . In d = 2, this is 
achieved by wrapping the domain of the form 
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periodically into a torus. Reflections in the horizontal and vertical lines of sites — 
the diagonals — are now symmetries of this graph; for nearest-neighbor interac- 
tions (of any sign) the corresponding torus Gibbs measure is reflection positive. 

The advantage of the diagonal torus is that it permits the use of reflection pos- 
itivity on collections of "bond events/' i.e., those associated with pairs of nearest- 
neighbor spins. Subsequent applications of chessboard estimates disseminate a 
single-bond event over the entire torus. This, in turn, helps in estimates of the 
quantity $(A) whenever A is an event depending on a single square that is itself 
an intersection of bond events: 

Lemma 5.10. Given a unit cube in Z d , let Ay, with b running over all of the q := d2 d_1 
edges in this cube, be a collection of bond events. Then 

d(f] A b) <ri3(A) 1/Cd (5.55) 

\ b ' b 

Here $(Ab) is the partition function per site restricted to configurations such that Ab, or 
its corresponding reflection, occurs at all edges ofX d . 

Proof. Let us first focus on d = 2. The key fact is that the partition function per 
site, }{A), does not depend on what boundary conditions were used to define it. 
So, in order to compute 3 of the intersection event, we may first wrap the square 
lattice into the diagonal torus, and disseminate the bond events before passing to 
the L — ► 00. As there are C2 = 4 edges in each lattice square, there is an extra power 
of V 4 . 

In d > 2, we perform the same by singling out two lattice directions and wrap- 
ping the torus diagonally in these, and regularly in the remaining ones. This ho- 
mogenizes the event in two lattice directions. Proceeding by induction, the claim 
follows. □ 



5.5 Literature remarks 



The material of this section is entirely classical; a possible exception is Lemma 5.9 
which seems to have been formulated in the present form only relatively re- 
cently (13). The use of reflection positivity goes back to the days of construc- 
tive quantum field theory (namely, the Osterwalder-Schrader axioms (84j) where 
RP was a tool to obtain a sufficiently invariant — and natural — inner product. 
The use in statistical mechanics was initiated by the work of Frohlich, Simon and 
Spencer [50 1 (infrared bound) and Frohlich and Lieb |48| (chessboard estimates). 
The theory was further developed in two papers by Frohlich, Israel, Lieb and Si- 
mon 11461 |47| . There have been a couple of nice reviews of this material, e.g., by 
Shlosman [l95l| and in Georgii [j57|. 

All use of reflection positivity in these notes is restricted to one of the three inter- 
actions introduced in Chapter[3] Various generalizations beyond these are possible. 
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For instance, the n.n. interaction of strength / may be accompanied by a n.n.n. in- 
teraction of strength A — including negative values — and the result is still RP pro- 
vided / > 2(d — 1) |A|. For reflections through planes of sites, we may even allow 
any sort of interactions involving the spins in a given lattice cube. (This exhausts 
all finite range interactions; any longer range RP interactions are automatically in- 
finite range.) Many other examples are discussed, e.g., in 11461 page 32]. 

Notwithstanding our decision to restrict attention only to three specific inter- 
actions, the set of reflection positive interactions is not so small as it may appear. 
Indeed, in the class of translation and rotation invariant coupling constants, letting 

F(xi,...,x d ) := Jo,* (5.56) 

we check that a sufficient conditions for RP is that the matrix 

(x,y) ^ F(xi + y 1 ,x 2 -y 2/ ... / x d -y d )l {xi>0} l {n>0} (5.57) 



is positive semidefinite. (See ( |5.15 l for a specific case of this.) By Shur's Theorem 



— namely that if (fly) and (b y ) are positive semidefinite matrices, then so is (flnfe) 

— we thus know that if JW and j( 2 ' are two collections of RP couplings, then also 
the collection J™ Jw is RP. m particular, the set of RP couplings is closed under 
taking products. 

The situation on other lattices is discussed in J47|; the use of diagonal reflec- 
tions goes back to 1951 . We caution the reader that it is rather easy to make a mis- 
take in this context. For instance, the regularly-wrapped (L x L) torus in Z 2 is also 
symmetric with respect to all of the diagonal reflections. However, for diagonal re- 
flection on direct torus it is not possible to define two components of the "plane of 
reflection" so that the reflection in one leaves the other intact. So we cannot simul- 
taneously use both direct and diagonal reflections, and this prevents a direct proof 
of ( |5.55| in finite volume. (This error appeared in [17, eq. 4.39] though, as shown in 



Lemma 5.10 all differences get washed out in the thermodynamic limit.) 

Gaussian domination appears in a rather different context as the celebrated 
Brascamp-Lieb inequality. Consider the measure on R" of the form 

H(dx) := Z _1 e- v Wdx (5.58) 

with V smooth and strictly convex. Let V"(x) be the Hessian, i.e., annxn matrix 
of all second derivatives of V. Then for each smooth / with compact support, 

Mf) ~ (V) 2 < / dx((y")" 1 V/W, V/(*)) (5.59) 

where (-, •) denote the n-dimensional Euclidean inner product. In particular, if Q 
is a positive definite n x n matrix that dominates the Hessian from below at all x, 
then the correlations of u are dominated by those of the Gaussian measure with 
covariance 2Q _1 . This is, unfortunately, not very useful in the analysis of the Gibbs 
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measures for general lattice spin systems as these are generally not of the required 
form — e.g., because the restriction to a specific spin-space (a unit sphere for the 
Heisenberg model) cannot be approximated by convex functions. 



6 Applications of chessboard estimates 

In this section we will apply the technique of chessboard estimates to obtain proofs 
of phase coexistence in some lattice spin models. The arguments will be carried out 
in detail only for one rather simple example. For more sophisticated systems we 
present only the important ideas. Details, anyway, can be found in the correspond- 
ing papers. 



6.1 Gaussian double-well model 



Here we will demonstrate the use of chessboard estimates on the model of a Gaus- 
sian free-field model in a non-quadratic, double-well on-site potential. The Hamil- 
tonian takes the general form 

0H(0) := p £ (</>* - cp y ) 2 + £ V(cp x ) (6.1) 

(*,y) x 

where <p x G R with a priori measure given by the Lebesgue measure, and V is a 
potential. Note that f> has been incorporated into the Hamiltonian in such a way 
that the on-site potential remains independent of it. 

The most well known example of such systems is V((p) := j(p 2 withK > which 
is known as the massive Gaussian free field. This case can of course be treated com- 
pletely explicitly; e.g., on the torus the corresponding Gaussian measure on ((p x ) is 
zero-mean with covariance 

e ifc-(x-y) 

Cov(^,^)= £ (6.2) 

where D(k) is the Fourier transform of the torus (discrete) Laplacian, 

d 

D(k) := £|l-e^| 2 (6.3) 

7=1 



Note that the inclusion of the mass, k > — more precisely, k is the mass squared 
— makes the covariance regular even for the zero mode k = 0. 
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We will look at a modification of this case when V takes the form 




In fact, we will be even more specific and assume that V is simply given by 



+ e"z 



-f(#+i) 2 



(6.4) 



It is easy to check that, for k sufficiently large, V defined using this formula looks 
as in the figure. The reason for assuming (|6.4|| is the possibility of an Ising-spin 



representation. Indeed, we may rewrite (6.4 1 as 

£T=±1 (T=±l 

where C := e~ K . A product of such terms is thus proportional to 

This means we can write the Gibbs weight of the model as follows 

(x ^ e~^ ^<**> fa* - *y) 2- 2 0* e~ K Ex ^ 1<r * 
("*) 



(6.5) 



(6.6) 



(6.7) 



If we elevate (<7 X ) to genuine degrees of freedom, we get a model on spins S x := 
{(px,o~x) with a priori law Lebesgue on R x counting measure on {—1,1} and the 
Hamiltonian 



pH((p,<r) :=jS YL{<px-<Py) 2 + \YL<t>l + KYL<pxO- x 

(x,y) 



(6.8) 



Notice the first two terms on the right-hand side is the Hamiltonian of the massive 
(centered) Gaussian free field while the interaction between the <^'s and the cr's has 
on-site form. 

Here are some observations whose (simple) proof we leave to the reader: 
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Lemma 6.1. Let jibe a Gibbs measure for Hamiltonian ( |6.8| l and let v be its ^-marginal. 
Then v is a Gibbs measure for the Hamiltonian \6.1) subject to ( |6.4| l. The marginal v 
completely determines u: For any f depending only on <p and a in a finite set A, 

E }i {f) = E v ( £ /OM II eWW ) ( 6 - 9 ) 

(oi)*eA *e A 

We will use 0^ /K to denote the set of all Gibbs measures for the Hamiltonian 
( |6.8| l with parameters j5 and jc. The principal result for this model is as follows: 

Theorem 6.2. Let d > 2. For each e > there is c > sucfo £?za£ if k, K /p > c, then there 
exist }i + , u~ € <5« K which are translation invariant and obey 

H ± {a x = ±\)>\-e (6.10) 

and 

E^O^T-l) 2 ) <e (6.11) 

In simple terms, at low temperatures and large curvature of the wells of V, the 
fields prefer to localize in one of the wells. We remark that, while we chose the 
model as simple as possible, a similar conclusion would follow for with V given 
by 

e ~vfo) = e - K -±(<p-i) 2 +h + e -^(<p + \Y- h _ 

where h changes the relative weight of the two minima. Indeed, there exists h t 
at which one has two Gibbs measure — the analogues of }i + and u~. Moreover, 
if k + ^> K-, then h t > because, roughly speaking, the well at —1 offers "more 
room" for fluctuations. 

6.2 Proof of phase coexistence 



Here we will prove Theorem 6.2 We will focus on d = 2; the proof in general 
dimension is a straightforward, albeit more involved, generalization. 

Let us refer to a face of Z 2 as a plaquette (i.e., a plaquette is a square of side one 
with a vertex of Z 2 in each corner). Given a spin configuration (cr x ), we say that 
a plaquette is good if all four spins take the same value, and bad otherwise. Let B 
denote the event that the plaquette with lower-left corner at the origin is bad. 

Since the interaction is that of the GFF with a modified single-spin measure, the 
torus Gibbs measure is RP. The crux of the proof is to show that bad plaquettes are 
suppressed. Specifically, we want to show that 

2(B)<1 once j 6,k>1 (6.13) 



Appealing to the subadditivity lemma (Lemma |5.9| we only need to estimate the 
3-value of all possible configurations on the plaquette that constitute B. Due to the 
plus-minus symmetry of the c's, it suffices to examine three patterns: 
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+- X- +X ( 6 - 14 ) 

We begin with the most interesting of the three: 
Lemma 6.3. For any f>, k > 0, 

3(+l) < (6.15) 

Proof. Let Zl := X^r / e~^ HL< ^ ,cr ) YIxgt l d</>x be the torus partition function. Given a 
plaquette spin pattern, let Zl (pattern) denote the same object with a fixed to the 
disseminated pattern — the sole element of rifeT L ®t (pattern) . (We are working 
with B = 1.) By the definition of 3 we have 



3L(;tr':=^<^ (6-16) 
Zl Z l (XX) 

Now the partition function with all c's restricted to + is given by 



-f> Z{x, y ) (fv-<?,/) 2 - j Ex <P% e - K Ex <Px 
...)E GFF (e-*^) 



n 



(6.17) 



where the expectation is with respect to the massive Gaussian free field and the 
prefactor denotes the integral of the Gaussian kernel over all <p x . Similarly we ob- 
tain 

'iWe-*^- 1 )'* 1 ) (6.18) 



where we noticed that by disseminating the pattern ^Iwe obtain a configuration 
which is one at even parity x and minus on at odd parity x. Thus we conclude 

" E GFF (e-^xfx(-i)i^i) (6 - iy) 

i.e., we only need to compute the ratio of the Gaussian expectations, and not the 
prefactors. 

Next we recall a standard formula for Gaussian moment generating functions: 
If X is a multivariate Gaussian, then 

E(e A. X) = e A.EX+iVar(A.X) (g 2Q) 

Since Egff (</>*) = 0/ we only need to compute the (diagonal) matrix element of 
Cov((p x ,(py) against vectors 1 = (1,1,...) and ( — 1)1*1 However, a quick look at 
( |6.2| | will convince us that these functions are eigenvectors of the covariance ma- 
trix corresponding to k = and k = (71,71), respectively. Since D(0) =0 while 
D(n, 7t) = 8, we get 
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Var 



GFF 



Var GFF (X]fT(-l) |j| 



|Tl| 

8,6 + k 



(6.21) 



(6.22) 



where the factor |Tj,| is the (square of) the L 2 (TIY)-norm of the functions under 
consideration. Plugging this in ( |6.19| l we conclude 



|Tl| 



< exp<^ - \T l \k 1 



8/5 + K K 



(6.23) 



from which the claim readily follows. □ 
Next we attend to the other patterns: 

Lemma 6.4. For any f>, k > 0, 

3(: 

and 



2fa 



28k 

3 (-+) < e <9fe 



(6.24) 



(6.25) 



Proof. As for ( 6.241 1, dissemination of |l leads to alternating stripes of plusses and 
minuses, i.e., o~ x = ( — l)l Xl L Again, this is an eigenvector of the covariance matrix 
( |6.2| | with k = (71,0). The corresponding D equals 4 and so 



(6.26) 



yielding ( 6.24| |. 

The pattern ^ is more complex because its dissemination will not lead to an 
eigenvector of the covariance matrix. However, we circumvent this problem by 
noting that Lemma 5T0] implies 



3(;l)<3(;±) 1/2 3(n) 1/2 <3(;i) 1/2 



(6.27) 



6.3 



□ 



where we used 3 ^ 1- Now ( |6.25| l follows from Lemma 

Corollary 6.5. For each e > there exists a > such that if ft,K > a, then 3(6) < e. 

Proof. The event B can be written as the union over a finite number of bad patterns. 
On the basis of Lemmas 6.3 -6.4 the claim holds for B replaced by any fixed bad 
pattern. The desired bound now follows — with slightly worse constants — by 
invoking Lemma 5.9 □ 



Next we explain our focus on the bad event: 
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Lemma 6.6. There exists a constant c € (I/ 00 ) such that if c$(B) < V2 then for any 
x,y e T L , 

Hd<r x = l,<r y = -1) < 2ci{B). (6.28) 

Proo/. This is a consequence of a simple Peierls' estimate. Indeed, if o~ x = 1 and o~ y = 
— 1, then x is separated from yby a "circuit" of bad plaquettes. (Formally, either all 
plaquettes containing x are bad or there exists a non-trivial connected component 
of good — i.e., not bad — plaquettes containing x. This component cannot cover 
the whole torus because o~ y = —1; the above "circuit" is then comprised of the bad 
plaquettes on the boundary of this component.) This means that 

H (cr x = l,o- y = -1) < £>( P| d t {B)) < £m hl (6-29) 

7 te-y 7 

where |7| denotes the maximal number of disjoint bad plaquettes in 7 and where 
we used the chessboard estimates to derive the second bound. By standard argu- 
ments, the number of circuits of "length" n surrounding x or winding around Ti 
at least once is bounded by c n , for some constant c > 1. It follows 

u L {a x = l t (r y = -1) < £ c n i{B) n (6.30) 

Under the condition c$(B) < V2 this sum is less than twice its first term. □ 

Finally, we can assemble the ingredients into the desired proof of phase coexis- 
tence: 



Proof of Theorem 6.2 By symmetry of the torus measure, we have 

m(cr x = 1) = V 2 = m(<r x = -1). (6.31) 

Let z be a site at the back of the torus — that is distant at least W2 from the origin 
— and define 

:= H (-\(r z = ±l). (6.32) 

These measures satisfy the DLR condition with respect to any function that de- 
pends only on the "front" of the torus and so any weak cluster point of these mea- 
sures will be an infinite-volume Gibbs measure. Extract such measures by subse- 
quential limits and call them u + and u~ , respectively. 



We claim that u + 7^ u . Indeed, by Lemma 6.6 we have 



y+(<r x = -\) <2ci(B) (6.33) 

once 3(6) 1 and, by Corollary |6.5| this actually happens once f>, k ^> 1. Thus if, 
say, 2c3(£>) < V4, then ]ti(o~ x = —1) < V4 and, at the same time, Hi(cr x = +1) < 
V4. The same holds for the limiting objects and so u + 7^ u~ . Note that the measures 
can be averaged over shifts so that they become translation invariant. □ 
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Notice that in the last step of the proof we used, rather conveniently, the plus- 
minus symmetry of the torus measure. In the asymmetric cases, e.g., ( 6.12| , one 
can either invoke a continuity argument — choose h = hi such that ( 6.31| holds 
— or turn ( 6.28} into the proof that |Tl| -1 Y1xgt l will take values in [—1, —1 + 
e] U [1 — e, 1] with probability tending to one as L —* oo. The latter "forbidden- 
gap" argument is rather robust and extends, with appropriate modifications, to all 
shift-ergodic infinite-volume Gibbs measures. Hence, the empirical magnetization 
in ergodic measures cannot change continuously with h. 



To prove Theorem 6.2 it remains to show the concentration of the (p's around 
the cr's: 



Proof of ( |6.11 1. Let \i be a Gibbs measure for parameters /3 and k. Then (6^1 shows 
that, conditional on the cr's, the (p's are Gaussian with mean 



^{(pxW) = k((2jSA + k)-V 



and covariance (2/3A + k) 1 , where A is the lattice Laplacian. Now, once 
we may expand the inverse operator into a power series to get 



E H ((p X \0-) - <7; 



= E(t)*< av >' 

«>i 



(6.34) 

0/* < 1 



(6.35) 



which by the fact that \a z \ =1 is 0(P/ K ) independently of x. Since the conditional 
variance of <p x is 0( 1 / K ), we obtain 



E u (((p x - <7 x ) 2 \cr) <2E /( -E(<^|(7)] 2 |(7) + 2{E{Q x \o) - £T X )' 
= 0((/5/ K ) 2 )+0(V K ) 



(6.36) 



with the constants implicit in the O's independent of the cr's and x. Thus, if k 3> 1 
and 2> 1, then ( 6.11| l follows by the fact that }i ± put most of the mass on cr x = 
±1. □ 



6.3 Gradient fields with non-convex potential 

Having demonstrated the use of chessboard estimates on a toy model, we will 
proceed to discuss more complicated systems. We begin with an example which is 
somewhat similar to the Gaussian double-well model. 

A natural generalization of the massless GFF is obtained by replacing the 
quadratic gradient interaction by a general, even function of the gradients. The 
relevant Hamiltonian (again with temperature incorporated in it) is 



{x,y) 



(6.37) 
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The requirements that we generally put on V are continuity, evenness and quadratic 
growth at infinity. Under these conditions one can always define finite-volume 
Gibbs measures. 

As to the measures in infinite volume, the massless nature of the model may 
prevent existence of a meaningful thermodynamic limit in low dimensions; how- 
ever, if one restricts attention to gradient variables, 

Vb '■= <Py — <px if & is the oriented edge (x,y), (6.38) 

then the infinite-volume Gibbs measures exist, and may be characterized by a DLR 
condition, in all d > 1. We call these gradient Gibbs measures (GGM). A non-trivial 
feature of the GGM is that they obey a host of constraints. Indeed, almost every rj 
is such that 

Vh + Vh + Vh + Vh=0 (6.39) 

for any plaquette (b\, . . . , b$) with bonds listed (and oriented) in the counterclock- 
wise direction. 

Surprisingly, the classification of all possible translation-invariant, infinite-volume 
GGMs can be achieved under the condition that V is strictly convex: 

Theorem 6.7. Suppose V is convex, twice continuously differentiable with V" bounded 
away from zero and infinity. Then the shift-ergodic GGMs ji are in one-to-one correspon- 
dence with their tilt, which is a vector a G R d such that 

E ¥ (n b ) = a-b (6.40) 

for every (oriented) bond b (we regard b as a unit vector for this purpose). 

The word tilt comes from the interpretation of a as the slope or the incline of the 
interface whose height-gradient along bond b is given by rj},. The proof of this result 
— which is due to Funaki and Spohn — is based on the use of the Brascamp-Lieb 
inequality through which the convexity assumption enters in an essential way. It is 
also known that the large-scale fluctuation structure of the rj's is that of a Gaussian 
Free Field. 

A natural question to ask is what happens when V is not convex. Specific ex- 
amples of interest might be V taking the form of a double-well potential — kind of 
like for the Gaussian double-well model — or V's as in the figure: 



(a) 






(b) 


V(r,) 

u 




1 
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As it turns out, the double-well case is not quite tractable at the moment — and 
most likely behaves like a massless GFF on large scales — but the other two cases 
are within reach. We will focus on the case (a) and, as for the Gaussian double-well 
model, assume a particular form of the potential: 

e -v( v ) ._ v e -K Q f/2 + (i _ p ) e -Wn (6 41) 

where k q and k d are positive numbers and p S [0, 1] is a parameter to be varied. 
For this system one can prove the following result: 

Theorem 6.8. Suppose d = 2 and k q ^> k d . Then there is p t £ (0/1) and, for V 
with p = p t , there are two distinct, infinite-volume, shift-ergodic GGMs }i 0T & and ^dis 
that are invariant with respect to lattice rotations and have the following properties: 

(1 ) zero tilt: 

1 _^ 

rj b — ► 0, ji oldr ^dis-a.s. (6.42) 

b=(x,y) 



x,yeA L 



I Ail 

(2) distinct fluctuation size: 

The upshot of this result is that, once the convexity of V is strongly violated, the 



conclusions of Theorem 6.7 do not apply. While the example is restricted to d = 2, 
and to potentials of the form ( |6.41| |, generalizations to d > 2 and other potentials 
as in the above figure are possible and reasonably straightforward. 

Here are the main steps of the proof. First, as for the Gaussian double-well 
model, we use ( 6.41| | to expand the Gibbs weight according to whether the first 



or the second term in ( |6.41[ ) applies. This gives rise to a configuration of coupling 
strengths (kj,), one for each bond b, which take values in {k q , k d }. The joint Hamil- 
tonian of the rj's and the jc's is 

pH(n,K):=^V 2 b (6.44) 

b Z 

The joint measure is RP with respect to reflections through bonds and sites and, 
conditional on (kj,), the tj's are Gaussian. 

For the proof of phase coexistence, we focus on lattice plaquettes and divide 
these into good and bad according to whether all of the edges have the same cou- 
pling K or not. The dissemination of each bad patterns leads to a Gaussian integral 
but this time for GFF with inhomogeneous — yet periodically varying — cou- 
plings. For instance, the pattern with three bonds of type k q and one of type k d 
disseminates into periodic configuration where the edges on every other vertical 
line is of type k d and all other edges are of type k q . Similarly for all other bad 
patterns. 
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The periodic nature of the disseminated events allows the use of Fourier modes 
— i.e., pass to the reciprocal torus — to diagonalize the requisite covariance ma- 
trices. For instance, the aforementioned pattern with three k q 's arid one k d leads 
to a configuration which is periodic with period two. A calculation shows that the 
covariance is block diagonal with 2x2 blocks of the form 



'K |fl-| 2 + l{K + K D )\b-\ 2 l(K -K D )\b- 



2 



n ^ ' V 5(Ko-K d )|M 2 K |fl+| 2 + i(K + K D )|b_| 2 j 

where a± and b± are defined by 

a± = 1 ± e ih and b± = 1 ± e ik2 (6.46) 

with k := (k-[,k2) varying through one half of the reciprocal torus T* L . (The block 
combines the contribution of both k and k + zrei, and so we only need half of all 
k's.) The requisite Gaussian integral then reduces to rijceTr*\{o} [det n(ft:)] ~ 1/4 where 
in the exponent we get V4 instead of the expected V2 to account for double counting 
of the k's. To estimate the growth rate of this product, we note that 

n [detn(ic)]- 1/4 

kGT* L \{0} 

= exp|-|T L |l| ^l gdetn(/c)+o(|T L |)} (6.47) 

The integral plays the role of the free energy associated with the Gaussian vari- 
ables on the background of the specific periodic configuration of the k's. A similar 
expression — with different integrand — applies to each pattern. 

Comparing the integrals for all possible arrangements of the two types of bonds 
around a plaquette, we find that under the condition k q ^> k d , the bad patterns are 
heavily suppressed. Thus bad plaquettes are infrequent and can be regarded as 
parts of a contour. As it is not possible to pass from all-K G pattern to all-K D pattern 
without crossing a bad plaquette, the coexistence follows — as for the double-well 
model — by a standard Peierls' argument and chessboard estimates. Full details of 
the proof are to be found in a paper by Kotecky and the present author. 

The two-dimensional model has the special feature that we can actually com- 
pute p t : 



(6.48) 



Theorem 6.9. Let d = 2. If k o/ Kd » 1, then p t is given by 

1 — Pt Vk / 

This is a consequence of a duality relation that can be used to exchange the roles 
of k and k d . It is also interesting to note that, while the one-to-one correspondence 
between the Gibbs measures and their tilt is violated for non-convex potentials, the 
large-scale fluctuation structure remains that of a Gaussian Free Field. Indeed, we 
have: 
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Theorem 6.10. Let d = 2. For each translation-invariant, ergodic gradient Gibbs mea- 
sure }i with zero tilt, there exists a -positive-definite d x d matrix q = q(u) such that for 
any smooth f: R 2 — > R with compact support and f f(x)dx = 0, 

f dx<p [x/ei f(x) A/-(0,(/,Q-V)) (6-49) 

where J\f(0, C) denotes a normal random variable with mean zero and covariance C and Q 
is the elliptic operator 

t,;=l * / 

The basis of this result — derived in all d > 1 by Spohn and the present au- 
thor — is the fact that, conditional on the k's, the <p's are Gaussian with mean zero 
and covariance given by the inverse of the generator of a reversible random walk 
in random environment. The Gaussian limit is a consequence of an (annealed) in- 
variance principle for such random walks and some basic arguments in homoge- 
nization theory. The restriction to zero tilt appears crucially in the proof. 



6.4 Spin-waves vs infinite ground-state degeneracy 

Next we will discuss a couple of spin models whose distinctive feature is a high 
degeneracy of their ground state which is removed, at positive temperature, by 
soft-mode spin-wave fluctuations. The simplest example with such property is as 
follows: 

Orbital compass model: Here the spins on Z d take values in a unit sphere in R d , i.e., 
S x E S^ 1 with x G Z rf . The Hamiltonian is 

d 



H(S):=EE( S i a) - S H) 2 (6-5D 

X (1 = 1 



(a) 

where S x denotes the a-th Cartesian component of the spin and e a is the unit 
vector in the a-th coordinate direction. 

Despite a formal similarity with the Heisenberg model, note that only one com- 
ponent of the spin is coupled in each lattice direction. Notwithstanding, every con- 
stant configuration is still a minimum-energy state of ( 6.51| . Further ground states 



may be obtained from the constant ones by picking a coordinate direction a and 
changing the sign of the a-th component of all spins in some of the "lines" paral- 
lel with e a . In d = 2 these are all ground states but in d > 3 other operations are 
possible that preserve the minimum-energy property. 

The key question is now what happens with this huge ground-state degen- 
eracy at positive temperatures. Here is a theorem one can prove about the two- 
dimensional system: 
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Theorem 6.11. For each e > there exist )6o > and, for each f> > Bq, there exist two 
distinct, shift-ergodic Gibbs measures U\,U2 G such that 

E Fj (\S x -ej\)>l-e, j = l,2 (6.52) 

Moreover, for any u G <5pwe have 

E f( (S x )=0 (6.53) 

and there are no shift-ergodic u G ®p, B > Bo, for which we would have max ;= i /2 E^, [\S X ■ 
g y |)<l-e. 

The main idea underlying the proof is the evaluation of the free energy associ- 
ated with spin-wave perturbations of the constant ground states; it this expected 
that only the states with the largest contribution of these fluctuations survive at 
positive temperatures. Specifically we need to quantify the growth rate of the torus 
partition function with all spins constrained to lie within A of a given direction: 

Lemma 6.12. For each e > there is 5 > such that if B, A obey 

BA 2 > ^ and fiA 3 < 5 (6.54) 
then for every vq := (cos 9, sin 9) G S 1 , 

E« M (e-« S > n 1(|5,-..KA)) = (^) lV2 e-W>+.M] (6.55) 

where 

F{9):=^J ^^logjsin^^ll-e^p + cos^e)!!-^^! 2 } (6.56) 

The quantity F has the interpretation of the spin-wave free energy where the term 
"spin wave" refers to slowly varying deformations of a constant ground states. A 
convexity argument — based on the identity sin 2 (6?) + cos 2 (6?) = 1 — now shows 
that F is minimized by 9 = 0, n /2, n, 3n /2, i.e., exactly in one of the coordinate di- 
rections. This corroborates the intuition that only the configurations with most of 
the spins aligned in one of these directions will be relevant at low temperatures. 
However, to extract a proof of phase coexistence, we will have to again invoke a 
Peierls' argument. 

Fix k > and let A := /3~tz and B := log/3 and let Be and £>sw denote the 
following events: 

(1) Be ■= { a pair of neighboring spins in A B differ by an angle > A } 
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(2) £>gw is the set of configurations in the complement of Be in which the block Ag 
has all neighboring spins within A of each other with at least k>A from one 
of the four coordinate directions 

The event Be captures the situations when two neighboring spins are not quite 
close to each other leading to excess energy order A 2 . As a result of that, 



3 (i3 E ) < 3B 3 e~ C3 ^ 2 



(6.57) 



The event $sw collects the configurations where the energy is good but the fluctu- 



ations are not sufficiently powerful. The calculation in Lemma 6.12| and a simple 
use of the subadditivity lemma show 



3(#sw) < 



B 3 K 2 



(6.58) 



for some constants C\, Cj_ > 0. Thus, for our choices of A and B, once B >> 1 the den- 
sity of blocks where Be U B$w occurs in any typical configuration from the torus 
measure will be rather small. However, if a block is aligned in one coordinate di- 
rection and another block is aligned in a different direction, they must be separated 
by a "circuit" of bad blocks. Such circuits are improbable which leads to phase sep- 
aration. Details of these calculations — which extend even to quantum setting — 
can be found in a paper by Chayes, Starr and the present author. 

120-degree model: A somewhat more complicated version of the interaction, but 
with the spins S x taking values in the unit circle S 1 , can be contrived in d = 3. 
The Hamiltonian will actually look just as for the orbital compass model except 

(a) 

that S x are not Cartesian components but projections on the three third-roots of 
unity 61,62, £3 in S 1 . Explicitly, 

H ( S ) : =E E (S*-b lV -S 

1,2,3 



(6.59) 



X CL- 



Again, all constant configurations are ground states and further ground states 
may again be obtained by judicious reflections. Fortunately, the number of energy- 
preserving operations one can perform on ground states is much smaller than for 
the orbital compass model, and all ground states can thus be classified. Namely, 
given a ground state configuration, every unit cube in "Z? looks as one of the four 
cubes in the picture 
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modulo, of course, a simultaneous rotation of all spins. Here is what we can we 
say rigorously about this model: 

Theorem 6.13. Let w 1( . . . , w^ G S 1 be the six sixth roots of unity. For each e > there 
exist /So > and, for each /3 > fto, there exist six distinct, shift-ergodic Gibbs measures 
G <8p such that 

E n (S x -ftj)>l-e, ; = 1,...,6 (6.60) 

There are no shift-ergodic ji G <8p, f> > f$o,for which we would have max ;=1/ .. /6 £ F (S x • 
wy) < 1 — e. 

The ideas underlying this theorem are quite similar to the orbital compass 
model. First we find out that the spin-wave free energy for fluctuations about the 
ground state pointing in direction 9 is given by 



a=l,2,3 



(6.61) 



where q\ : = sin 2 (0), q 2 := sin 2 (0 - 120°) and q 3 := sin 2 (6? + 120°). A surprisingly 
sophisticated argument is then required to show that F is minimal only for 8 of 
the form jj, j = 1, . . . , 6. Once we have this information, the rest of the argument 
follows a route very similar to that for the orbital compass model (including the 
introduction of the scales k and A and the corresponding events <Be and £>sw)- 
Details appeared in a paper by Chayes, Nussinov and the present author. 

n.n. and n.n.n. antiferromagnet: Finally, we will consider a toy model that exempli- 
fies the features of both systems above. Here d = 2 and the spins take again values 
in S 1 , but the interaction is antiferromagnetic — that is, with a preference for an- 
tialignment — for both nearest and next-nearest neighbors: 

H ( S ) '■= lJ^[ S x ■ S x +e! + S x ■ S x+ e 2 ] +J^[S X - S x+ e 1+ e 2 + S x ■ S x+ gi-e 2 ] (6-62) 
x x 

Assuming |7| < 2, the minimum energy state is obtained by first enforcing the 
n.n.n. constraints — there is an antiferromagnetic, or Neel, order on both even and 
odd sublattice — and only then worrying about how to satiate the n.n. constraint. 
But once the sublattices are ordered antiferromagnetically the net interaction be- 
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tween the sublattices is zero — and so each of the sublattices can be rotated inde- 
pendently! Here is a configuration of this form: 



For this system we can nevertheless prove the following theorem: 

Theorem 6.14. For each e > there exist /3o > and, for each /3 > /3o, there exist two 
distinct, shift-ergodic Gibbs measures U\,]ii € &p such that 

- E }lj (S x ■ Sx+gjiez) > 1 - e (6.63) 

and 

E n (S x -S x+ ^) >l-e, 7 = 1,2 (6.64) 

There are no shift-ergodic u £ &p, /3 > f>Q,for which either ( 6.63} or at least one of ( 6.64 | 
does not hold. 

As for the two models above, everything boils down to a spin-wave calculation. 
Here the relevant parameter is the relative orientation 9 of the two antiferromag- 
netically ordered sublattices. The spin-wave free energy is then 



where 



D k (0) := ll-e^+^p + ll-e^ 1 -^! 2 

+ 7cos(0) (|1 - e ikl | 2 - |1 - e ikl | 2 ) (6.66) 
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As D(0) = «D(0) + (1 - «)D(7r), with a := |(1 + 005(0)), Jensen's inequality 
for the logarithm directly shows that F is minimized by 9 = or 9 = n. In spin 
configurations, the former corresponds to horizontal alignment and vertical an- 
tialignment of nearest neighbors, and the latter to horizontal antialignment and 
vertical alignment, i.e., stripe states. Details of all calculations appeared in a paper 
by Chayes, Kivelson and the present author. 

Notice that, despite the fact that the lattices maintain a specific relative orien- 
tation at low temperatures, a Mermin-Wagner argument ensures that every Gibbs 
measure is invariant under a rigid rotation of all spins. 



6.5 Literature remarks 

The Gaussian double-well model is a standard example which can be treated ei- 
ther by methods of reflection positivity or by Pirogov-Sinai theory fl35|. Represen- 



tations of the kind (6A\ have been used already before, e.g., by Kiilske [74l |75| 
and Zahradnik [107|. The method of proof presented here draws on the work of 
Dobrushin, Kotecky and Shlosman |33l 1711 |69| which was used to control order- 
disorder transitions in a number of systems; most notably, the ^-state Potts model 
with q 2> 1 |69| . These methods may be combined with graphical representations 
of Edwards-Sokal [39 1 (or Fortuin-Kasteleyn |43|) to establish rather complicated 
phase diagrams, e.g., |26llT2"f . Recently, the method has been used to resolve a con- 
troversy about a transition can occur in 2D non-linear vector models fl4"l~ll4"2| . 



Theorem 6.7 has been proved by Funaki and Spohn |53l| . As already mentioned, 
their proof is based on convexity properties of the potential V — by invoking the 
Brascamp-Lieb inequality as well as certain coupling argument to the natural dy- 
namical version of the model — and so it does not extend beyond the convex case. 
(A review of the gradient measures, and further intriguing results, can be found 
in Funaki [52], Velenik [104J or Sheffield |92J.) Theorem |6.8| was proved by Biskup 



and Kotecky [17]; Theorem 6.10 was derived by Biskup and Spohn 1181 ■ 



The interest in models in Sect. 6.4 came from a physics controversy about 
whether orbital ordering in transition-metal oxides exists at low temperatures. 
On the basis of rigorous work by Biskup, Chayes and Nussinov [13] (120-degree 
model) Biskup, Chayes, Nussinov and Starr [15] [HI (2D and 3D orbital compass 
model), it was demonstrated that, at least at the level of classical models, spin-wave 
fluctuations stabilize certain ground states II83II . The conclusions hold also the 2D 
quantum orbital-compass model with large quantum spins [15]. The mechanism 
of entropic stabilization — or, in physics jargon, order by disorder — is most clearly 
demonstrated in the n.n. & n.n.n. antiferromagnet studied by Biskup, Chayes and 
Kivelson fllT). This model actually goes back to the papers by Shender [93] and 
Henley II63II which first spelled out the original order-by-disorder physics argu- 
ments. 



All three "phase-coexistence" theorems in Sect. 6.4 have, apart from an exis 



tence clause, also a clause on the absence of ergodic states whose local properties 
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deviate from those whose existence was asserted. Actually, these were not the con- 
tent of the original work [l3j[Tll because, at that time, the focus on torus measures 
dictated by reflection positivity was deemed to make it impossible to rule out the 
occurrence of some exotic measures. A passage to such statements was opened by 



the work of Biskup and Kotecky [16]; the non-existence clauses in Theorems 6.11 



6.13 and 6.14 are direct consequences of the main result of IITBI and the method of 



proof of the existence part. This technique does not quite apply in the setting of 
gradient models due to the strong role the boundary conditions play in this case. 



7 Topics not covered 

There are naturally many interesting topics dealing with reflection positivity that 
have not been covered by these notes. Here we will attempt to at least provide 
a few relevant comments and give pointers to the literature where an interested 
reader may explore the subject to the desired level of detail. 

The first (and large) area which was neglected is that of quantum models. Here 
one faces the principal difficulty that the spin variables are replaced by oper- 
ators which, generally, do not commute with one another. Nevertheless, reflec- 
tion positivity can be proved for reflections through planes between sites under 
the condition that the Hamiltonian is of the form ( |5.10 |. (For reflections through 



planes of sites the non-commutativity of involved objects makes the above tech- 
nology largely unavailable.) Thus, chessboard estimates and, by a passage via the 
Duhamel two-point function, also infrared bound can again be established. This 
and the resulting applications to proofs of phase transitions in, e.g., the quantum 
Heisenberg anh'ferromagnet and XY-model constitute the papers of Dyson, Lieb 
and Simon 1138 1 and Frohlich and Lieb [|48| . A pedagogical account of these can be 
found in the notes by Toth fl!03| . 

Unlike for the classical models, in the quantum setting reflection positivity ap- 
pears to be a somewhat peculiar condition. Generally, it requires that the involved 
operators can be represented by either real or purely imaginary matrices. This 
is where the technique fails in the case of the quantum Heisenberg ferromagnet 
(Speer H99l , but see also Kennedy [67 \ and Conlon and Solovej [27]). Notwithstand- 
ing, the technique continued to be applied in the quantum world to derive useful 
conclusions; e.g., to study long range order in two-dimensional antiferromagnets 
(Kennedy, Lieb and Shastry |68|), to resolve the so called flux phase problem in 
the Hubbard model (Lieb [78], see also Macris and Nachtergaele |5T|) or to prove 
uniqueness of the ground state in the half-filled band therein (Lieb l|77P. The latter 
work invokes spin-reflection positivity; a new idea later further exploited by, e.g., 
Tian [ 102 1 and Tasaki [|101| . Other applications of reflection positivity in itinerant- 
electron models appear in, e.g., Macris [79] and Macris and Lebowitz [80]. 

As already mentioned, one can use RP to develop a rigorous link between the 
phase transitions in quantum and classical systems (Biskup, Chayes and Starr [15]). 
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Here the main idea is the conversion of the quantum chessboard estimate to the 
classical one by means of an extension of Berezin-Lieb inequalitites to matrix ele- 
ments in the basis of coherent states. 

Another topic not sufficiently represented in these notes is that of dimer or other 
combinatorial models. Here we wish to mention, e.g., the conclusions concern- 
ing the six-vertex model and hard-core lattice gasses (Frohlich, Israel, Lieb and 
Simon (47l) or the liquid-crystal models based on interacting dimers (Heilmann 
and Lieb [62J and Abraham and Heilmann [lj). There is also a novel application to 
characterization of graph homomorphisms (Freedman, Lovasz and Schrijver [44 j). 

The origin of reflection positivity lies within the field theory as part of the 
Osterwalder-Schrader axioms. A reader interested in this direction should employ 
the relevant search outlets to explore the literature on the subject. For statistical 
mechanics, interesting applications come in the proofs of phase transitions in Eu- 
clidean field theories, e.g., that of quark confinement (Borgs and Seiler |20]P or 
chiral symmetry breaking (Salmhofer and Seiler H9T1 ) in gauge theories. 

Finally, there is the recent clever application of chessboard estimates to control 
the rigidity of Dobrushin interfaces in the Ising (and some other) three dimensional 
models (Shlosman and Vignaud [96 j). This direction will likely be further exploited 
to study interface states in continuum-spin systems. 

8 Three open problems 

We finish with a brief discussion of three general open problems of the subject cov- 
ered by these notes which the present author finds worthy of significant research 
effort. 

In Chapters [3] and |4] we have shown how useful the infrared bound is in proofs 
of symmetry breaking and control of the mean-field approximation. Unfortunately, 
the only way we currently have for proving the IRB is reflection positivity. So our 
first problem is: 

Problem 8.1. Consider models with the Hamiltonian H = — Y<{x,y) $x • Sy- Prove 
the IRB directly without appeal to RP. 

As already mentioned, a successful attempt in this direction has been made by 
Sakai |90l[ , who managed to apply the lace expansion to a modified random cur- 
rent representation of the Ising model. However, here we have in mind something 
perhaps more robust which addresses directly the principal reason why we need 
RP, which is that 

the spins (S x ) are not a priori independent Gaussian 

Among approaches in this direction is the spherical approximation for the 0{n) 
model, in which the constraint \S X \ = 1 at every spin is replaced by a constraint 
on £ x IS* | 2 . 
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The IRB is often viewed as a rigorous version of spin-wave theory. This theory, 
initiated in the work of Dyson [37| and others, describes continuous deformations 
of the lowest energy states by means of an appropriate Gaussian field theory. In 
Chapter [6] we saw that chessboard estimates may be applied in conjunction with 
spin-wave calculations — which are generally deemed to be the realm of the IRB 
— to prove phase transitions. This was possible because spin-waves disqualified 
all but a finite number of ground states from candidacy for low-temperature states. 
Notwithstanding, one might be able to do the same even in the presence of in- 
finitely many low-temperature states: 

Problem 8.2. Prove symmetry breaking at low temperatures in systems with con- 
tinuous internal symmetry — e.g., the 0(2)-model — without the use of the IRB. 
Chessboard estimates are allowed. 

An interesting resource for thinking about this problem may be the paper of 
Bricmont and Fontaine ||2T| . 

Further motivation to look at this problem comes from quantum theory: The 
quantum Heisenberg ferromagnet is not RP (see Speer 199 J) and so there is no proof 
of the IRB and, consequently, no proof of low-temperature symmetry breaking. 
On the other hand, the classical Heisenberg ferromagnet is RP and so the spin- 
condensation argument applies. However, if we had a more robust proof of sym- 
metry breaking in the classical model, e.g., using chessboard estimates, one might 
hope to extend the techniques of Biskup, Chayes and Starr Ifl5| to include also the 
quantum system. 

While the theory described in these notes is not restricted exclusively to ferro- 
magnetic systems, in order to have the IRB one needs a good deal of attractivity in 
the system. It is actually clear that the IRB cannot hold as stated for antiferromag- 
nets, e.g., hard core lattice gas, which is a model with variables n x £ {0/1} and the 
"Gibbs" weight proportional to 

\E x ti x Y\(l- n x n y ), (8.1) 

or the ^-state Potts antiferromagnet, which is the model in ( |2.7| | with / < 0. Indeed, 
the staggered long-range order, which is known to occur in the hard core lattice gas 
once A > 1, implies that the macroscopically occupied mode is k = (tc,. . . ,n) 
rather than k = 0. Nevertheless, we hope that some progress can be made and so 
we pose: 

Problem 8.3. Derive a version of the IRB for the hard-core lattice gas and/ or the 
^-state Potts antiferromagnet at zero temperature. 

Solving this problem would, hopefully, also provide an easier passage to the 
proof that the critical A for the appearance of staggered order tends to zero 
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as d — > oo — in fact, if the mean-field theory is right then one should have A c ~ C U 
— and that the 3-coloring of Z 2 exhibits six distinct extremal measures of maxi- 
mal entropy. These results have recently been obtained by sophisticated contour- 
counting arguments H5HI55I . 
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